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ALIGNMENTS 



RESULT 1 
ABB81074 

ID ABB81074 standard; protein; 1163 AA. 
XX 

AC ABB81074; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; receptor. 



[ 



XX 

OS Rattus norvegicus. 
XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-00893348 . 
XX 

PR 19-MAY-1998; 98IL-00124500 . 

PR 21-JUL-1998; 98WO-US0147 15 . 

PR 22-DEC-1998; 98US-00218277 . 

PR 19-MAY-1999; 99US-00314161 . 
XX 

PA ( YEDA ) YE DA RES & DEV CO LTD. " 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86600. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Example 5; Page 44-47; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer f s disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the rat 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 



CC antigen 
XX 

SQ Sequence 1163 AA; 



Query Match 100.0%; Score 5848; DB 5; Length 1163; 

Best Local Similarity 100.0%; Pred. No. 3.9e-297; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 P AAGL S AAAVP P AAAAP LLD FS S D S VP PAP RG P L P AAP PAAP ERQ P S WERS P AAPAP S L P 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

I I II II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AILVENTKEEV1VRSKDKEDLVCS7UVLHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Qy 361 P VREEYADFKP FEQAWEVKDT YEGS RDVLAARANVES KVDRKCLEDS LEQKS LGKDS EGR 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

I I I II I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 72 0 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 7 81 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 84 0 

I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I M I I I I I I I I I II I I 
Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 I1DEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II II I I I I II I I I I I I I I I 

Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I II 
Db 1021 VT I S FRI YKGVIQAI QKS DEGHPFRAYLES EVAI S EELVQKYSNSALGHVNST I KELRRL 1080 

Qy 1081 FLVDDLVDS LKFAVLMWVFT YVG7VL FNGLTLLI LALI S LFS I PVI YERHQVQI DH YLGLA 1140 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 FLVDDLVDSLKFAVIJ^WFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I II I I I I I I I I I II 
Db 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 2 


AAY71310 


ID 


AAY71310 standard; protein; 1163 AA. 


XX 




AC 


AAY71310; 


XX 




DT 


02-NOV-2000 (first entry) 


XX 




DE 


Rat neurite growth inhibitor Nogo A. 


XX 




KW 


Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 


KW 


central nervous system; neoplastic disease; antiproliferative; glioma; 


KW 


antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 


KW 


degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 


KW 


hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 


KW 


psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 


KW 


structural plasticity; screening. 


XX 




OS 


Rattus sp. 


XX 




FH 


Key Location/Qualifiers 


FT 


Inhibitory-site 1. .171 


FT 


/note= "Inhibits NIH 3T3 fibroblast spreading" 


FT 


Modif ied-site 30 


FT 


/note= "Casein kinase II site" 



\ 



FT Region 31. .58 

FT /note= "Acidic region" 

FT Region 31. .57 

FT /note= "Region specifically described in claim 16" 

FT Region 172. .259 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Modified-site 233 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 242. .244 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 291 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 2 95 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 404 

FT /note= "Encoded by TTG" 

FT Modified-site 436 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 468. .470 

FT - /note- "Asn is N-glycosylated" 

FT Modified-site 484 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 488 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 502 

FT ' /note= "Casein kinase II site" 

FT Inhibitory-site 542. .722 

FT Modified-site 576 

FT /note- "Casein kinase II site" 

FT Peptide ' 623. .640 

FT /note= "used as immunogen to generate antibody AS 472" 

FT Modified-site 626 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 694. .696 

FT /note- "Asn is N-glycosylated" 

FT Modified-site 715 

FT /note= "Casein kinase II site" 

FT Peptide 762. .1163 

FT /note= "used as immunogen to generate antibody AS Bruna 

FT Modified-site 7 84 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 821 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 850 

FT _ /note- "Protein kinase C (PKC) site" 

FT Modified-site 855 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 863 

FT /note= "Casein kinase II site" 

FT Modified-site 868 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 893 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 912. .914 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 925. .927 



FT /note= "Asn is N-glycosylated" 

FT Modified-site 954 

FT /note= "PKC and casein kinase II sites" 

FT Modified-site 956 

FT /note- "PKC and casein kinase II sites" 

FT Region 975. .1162 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Region 976. .1163 

FT /note- "C-terminal common region found in Nogo A, B and C 

FT isoforms" 

FT Domain 988. .1023 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region specifically 

FT described in claim 16" 

FT Modified-site 1024 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 1071. .1073 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1073 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 1089 

FT /note= "Protein kinase C (PKC) site" 

FT Domain 1090. .1125 

FT /label= Transmembrane_domain 

FT /note- "C-terminal hydrophobic region specifically 

FT described in claim 16" 

FT Modified-site 1141. .1143 

FT /note= "Asn is N-glycosylated" 

FT Modified-site .. 1143 

FT /note- "Protein kinase C (PKC) site" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 

DR N-PSDB; AAD01173. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 3; Fig 2A; 122pp; English. 
XX 

CC The present sequence is a rat Nogo A protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The protein was 

CC derived from a cDNA generated by fusing R018U37-3, R1-3U21 cDNAs isolated 



CC from hexanucleotides-primed rat brain stem/spinal cord library, and 01il8 

CC cDNA from an oligo d(T) -primed rat oligodendrocyte library. Nogo proteins 

CC and fragments displaying neurite growth inhibitory activity are used in 

CC the treatment of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 

CC Alzheimer's and Parkinson's diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysprolif erative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: The present sequence 

CC designated as SEQ ID NO: 2 is stated to be the same as the sequence shown 

CC in Fig. 13 (see AAY71384) of the specification. However, this sequence 

CC does not match the sequence given in Fig. 13. SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1163 AA; 

Query Match 100.0%; Score 5846; DB 3; Length 1163; 

Best Local Similarity 99.9%; Pred. No. 5e-297; 

Matches 1162; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M II 
Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 . 

I I I I M M I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I I I I I I I Ml II I I I M I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I 
Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSA7VLHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

I | | | | I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I II II I I I I I II I M I 
Db 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I I I I I I I I I I I 
Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQKSLGKDSEGR 420 



Ov 


421 


NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 


480 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 




Db 


421 


NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 


480 


Ov 


481 


QIITEKTSPKTSNPFLVAVQDSEADYWTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 


540 






1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 


540 


Ov 


541 


Nt^TGTKIAYETKVDLVQTSL^IQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 




I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 


wy 


601 


L P S AGAS WQP S VS P LEAP P P VS YD S I KL E P EN P P P Y E EAMNVAL KALGT KE G I KE P E S F 


660 




I | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVAiKALGTKEGIKEPESF 


660 


Ov 


661 


NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 




II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 M 1 1 




Db 


661 


NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 




721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 






M 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 M 1 1 1 




Db 


721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 




7 R1 
/ox 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 


840 




1 1 I M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 




Db 


781 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 


840 


yy 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 




I I I M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 


wy 


QD1 

zj \J x 


DEVHVSDEFSENRS SVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 




I I I I 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


901 


DEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 


Ov 


961 


RSLSAVLSAELSKTSWDLLWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 


1020 




I I I I I I I I I 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


961 


RSLSAVLSAELSKTSVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA1ALLS 


1020 


Ov 
vy 


1021 


VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYSNSALGHVN ST I KELRRL 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 




Db 


1021 


VTISFRI YKGVI QAIQKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 


1080 


Qy 


1081 


FLVDDLVDSLKFAVXMWFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 


1140 




I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1081 


FLVDDLVDSLKFAVL1WVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 


1140 


Qy 


1141 


NKSVKDAMAKI QAKI PGLKRKAD 1163 
1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 




Db 


1141 


NKSVKDAMAKI QAKI PGLKRKAD 1163 





RESULT 3 
AAY71557 
ID AAY71557 



standard; protein; 1162 AA. 



XX 

AC AAY71557; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A truncated protein used in the construction of mutant Nogo-A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer f s disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PE 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-010744 6P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; _ . 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperproliferative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders ^or diseases of the CNS. The present sequence 

CC is a truncated form of rat Nogo A protein shown in AAY71310, which is 

CC used in the construction of mutant Nogo-A. Nogo-A is composed of His- 

CC tag/T7-tag/vector/Nogo-A sequence aa 1-1162. Nogo A deletion mutants were 



CC used for mapping the inhibitory sites of Nogo protein. Major inhibitory 

CC region was identified in the Nogo A sequence from amino acids 172-974, 

CC particularly amino acids 542-722. In addition, N-terminal region 1-171 

CC was found to be inhibitory to NIH 3T3 fibroblast spreading. Note: The 

CC present sequence is not given in the specification but is derived from 

CC rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification. 

CC However, the specification does not include sequences for these SEQ ID 

CC numbers 
XX 

SQ Sequence 1162 AA; 

Query Match 99.9%; Score 5840; DB 3; Length 1162; 

Best Local Similarity 99.9%; Pred. No. le-296; 

Matches 1161; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I i I I I I I II 

MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I I I I II I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I II I I I I II I I I I II I 

PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I M I I I I I I I I I I I M M I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I II I I I I 
PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I | | | | | | | I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I M M I I I I I I I I I I I I I I I I 

AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I M I I I I I I I I 
GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

PWEEYADFKPFEQAWEVKDTYEGSRDVL7V7VRANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I : I I I M I I I I I I I I I I I 
PWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQKSLGKDSEGR 420 

NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

I M I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I II II I II I I I I I 

NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEE7VEATPSPVLPDIVMEAPLNSL 600 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 



Qy 



601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 



1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1' II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

I I I I 1 I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M 
Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 7 80 

I I I I I I I I I I I I I I I I I I I I I II I I I II I II II I I I I I I I I I I I I I I I II I I I I I I II I I 
Db 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

I I I I I I I I II I I II I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

I I I M I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I II I II I I I I I 
Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 RSLSAVLSAELSKTSVVTiLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLS 1020 

Qy 1021 VT I S FRI YKGVI QAI QKS DEGH PFRAYLES EVAI S EELVQKYSN SALGHVNST I KELRRL 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1021 WISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKELRRL 1080 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKA 1162 

I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 NKSVKDAMAKIQAKIPGLKRKA 1162 



RESULT 4 
AAY71384 

ID AAY71384 standard; protein; 1163 AA. 
XX 

AC AAY71384; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Alternative version of rat neurite growth inhibitor Nogo A. 
XX 

KW Rat; neurite growth, inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening. 



XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT Inhibitory-site 1. .171 

FT /note= "Inhibits NIH 3T3 fibroblast spreading" 

FT Modifiecl-site 30 

FT /note= "Casein kinase II site" 

FT Region 31. .58 

FT /note= "Acidic region" 

FT Region 172. .259 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Misc-dif ference 223 

FT /label= Unknown 

FT /note= "There is Leu at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 233 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 242. .244 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 291 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 295 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 404 

FT /note= "There is lie at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 436 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 468. .470 

FT /note= "Asn is N-glycosylated" 

FT Misc-dif ference 469 

FT /label= Unknown 

FT /note= "There is Lys at this position in the sequence 

FT < shown in AAY71310" 

FT Modified-site 484 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 488 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 502 

FT /note= "Casein kinase II site" 

FT Inhibitory-site 542. .722 

FT Modified-site 57 6 

FT /note= "Casein kinase II site" 

FT Peptide 62 3. .64 0 

FT /note= "used as immunogen to generate antibody AS 472" 

FT Modified-site 626 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 661 

FT /note= "There is Asn at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 694. .696 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 715 

FT /note= "Casein kinase II site" 

FT Peptide 762. .1163 



FT 




/note= "used as immunogen to generate antibody AS Bruna" 


FT 


Modified- site 


784 






FT 




/note= "Protein kinase C (PKC) 


site" 




FT 


Misc-dif ference 820 






FT 




/note= "There is Leu at this position in 


the sequence 


FT 




shown in AAY71310" 




FT 


Modified- site 


821 






FT 




/note= "Protein kinase C (PKC) 


site" 




FT 


Modif ied-site 


850 






FT 




/note= "Protein kinase C (PKC) 


site" 




FT 


Modif ied-site 


855 






FT 




/note= "Protein kinase C (PKC) 


site" 




FT 


Modif ied-site 


863 






FT 




/note= "Casein kinase II site" 






FT 


Modif ied-site 


868 






FT 




/note= "Protein kinase C (PKC) 


site" 




FT 


Modif ied-site 


893 






FT 




/note= "Protein kinase C (PKC) 


site" 




FT 


Modif ied-site 


912. .914 






FT 




/note= "Asn is N-glycosylated" 






FT 


Modif ied-site 


925. .927 






FT 




/note= "Asn is N-glycosylated" 






FT 


Modif ied-site 


954 






FT 




/note= "PKC and casein kinase 


II sites" 




FT 


Modif ied-site 


956 






FT 




/note= "PKC and casein kinase 


II sites" 




FT 


Region 


975. .1162 






FT 




/note= "This region is not essential for 


inhibitory 


FT 




activity" 






FT 


Region 


976. .1163 






FT 




/note= "C-terminal common region found in 


Nogo A, B and C 


FT 




isoforms " 




FT 


Domain 


988. .1023 






FT 




/label= Transmembrane domain 






FT 




/note= "C-terminal hydrophobic 


region" 




FT 


Modif ied-site 


1024 






FT 




/note= "Protein kinase C (PKC) 


site" 




FT 


Modif ied-site 


1071. .1073 






FT 




/note= "Asn is N-glycosylated" 






FT 


Modif ied-site 


1073 






FT 




/note= "Protein kinase C (PKC) 


site" 




FT 


Modif ied-site 


1089 






FT 




/note- "Protein kinase C (PKC) 


site" 




FT 


Domain 


1090. .1125 






FT 




/label= Transmembrane . domain 






FT 




/note= "C-terminal hydrophobic 


region" 




FT 


Modif ied-site 


1141. .1143 






FT 




/note= "Asn is N-glycosylated" 






FT 


Modif ied-site 


1143 






FT 




/note= "Protein kinase C (PKC) 


site" 




XX 










PN 


WO200031235-A2. 








XX 










PD 


02-JUN-2000. 








XX 










PF 


05-NOV-1999; 


99WO-US026160. 






XX 











PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB ME. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons . 

XX 

PS Claim 3; Fig 13; 122pp; English. 
XX 

CC The present sequence is an alternative version of rat Nogo A protein 

CC which is a potent neural cell growth inhibitor and is free of all central 

CC nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson 1 s diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. Note: The present 

CC sequence is an alternative version of the Nogo A sequence shown in Fig. 

CC 2A (see AAY71310) . SEQ ID numbers 35-42 are referred in claim 32 and SEQ 

CC ID NO: 29 in disclosure of the specification. However the specification 

CC does not include sequences for these SEQ ID numbers 

XX 

SQ Sequence 1163 AA; 

Query Match 99.6%; Score 5823; DB 3; Length 1163; 

Best Local Similarity 99.7%; Pred. No. 7.9e-296; 

Matches 1159; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I > I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 P AAGL S AAAVP P AAAAP L L D F S S D S VP PAP RG P L P AAP P AAP E RQ P S WE R S P AAP AP S L P 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I M 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 



Qy 



181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETT^ASLPSLSPLSTVSFKEHGYL 
I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 



240 



Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASXPSLSPLSTVSFKEHGYL 240 



Qy 


241 


GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSS'FKGSPKGES 


300 






1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 I 1 1 1 II 1 




Db 


241 


GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 


300 


Qy 


301 


AILVENTKEEVIVRSKDKEDLVCS7\ALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 




Db 


301 


AI LVENTKEEVIVRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 


360 


Qy 


361 


PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 


420 






1 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 I 1 1 1 t 1 1 1 1 1 1 t 1 1 1 1 1 1 1 l l l l l l l l l l l t t l 
1 1 1 M 1 II II 1 II 1 1 1 1 1 II M II 1 i ! 1 II M | M | M [ | 1 1 II 1 1 II M II 1 I 1 II II 1 




Db 


361 


PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 


420 


Qy 


421 


NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 


480 






1 1 M 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 




Db 


421 


NEDASFPSTPEPVKDSSRAYITCAS FT SATESTTANTFPLLEDHTSENXTDEKKIEERKA 


480 


Qy 


481 


QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 


540 


Db 


481 


QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 


540 


Qy 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 II 




Db 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 


Qy 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 


660 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 


660 


Qy 


661 


NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 






1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 




Db 


661 


MAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 


Qy 


721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 


Qy 


781 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


NLHSTKD7VASNDIPTLTKKEKISLQMEEFNTAI YSNDDLSSSKEDKIKESETFSDSSPIE 


840 


Qy 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 


Qy 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 






1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 


Qy 


961 


RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 | | | | | | | || | 




Db 


961 


RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 


1020 


Qy 


1021 


VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


1021 


VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 


1080 



Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

I Ml I I I I I II I I I II I I I I I I I I I II I M I I I I I I II I II I I I I I I I I I I I II I I I I I I 
Db 1081 FLVD D LVD S L KFAVLMWVFT YVGAL FN G LT LL I LAL I S L F S I P VI YE RHQVQ I DH Y L GLA 1140 

Qy 1141 NKS VKDAMAKI QAKI PGLKRKAD 1163 

I I I I I I I I I II I I I I I I II I I I I 
Db 1141 NKS VKDAMAKI QAKI PGLKRKAD 1163 



RESULT 5 
AAY71560 

ID AAY71560 standard; protein; 974 AA. 
XX 

AC AAY71560; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A protein fragment used in the construction of mutant NiAext. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 



CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired,. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is a fragment of rat Nogo A protein shown in AAY71310, which is used in 

CC the construction of mutant NiAext. The mutant is composed of His-tag/T7- 

CC tag/vector/Nogo-A sequence aa l-974/T7-tag . Nogo A deletion mutants were 

CC used for mapping the inhibitory sites of Nogo protein. Major inhibitory 

CC region was identified in the Nogo A sequence from amino acids 172-974, 

CC particularly amino acids 542-722. In addition, N-terminal region 1-171 

CC was found to be inhibitory to NIH 3T3 fibroblast spreading. Note: The 

CC present sequence is not given in the specification but is derived from 

CC rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification. 

CC However, the specification does not include sequences for these SEQ ID 

CC numbers 
XX 

SQ Sequence 974 AA; 

Query Match 84.1%; Score 4921; . DB 3; Length 974; 

Best Local Similarity 99.9%; Pred. No. 8.8e-249; 

Matches 973; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I 
Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 P7WVVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I M I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I II I I I II I I I I I II I I 
Db 181 AAS EPVI P S SAEKIMDLMEQPGNTVS S GQEDFP SVLLETAAS LP S LS PLSTVS FKEHG YL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I | | | II | | | | | || | | | || | | | 
Db 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVIAARANVESKVTiRKCLEDSLEQKSLGKDSEGR 420 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I : I I I I I I I I I I I I I I I I 
Db . 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQKSLGKDSEGR 42 0 



Qy 


421 


NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 


480 






II 1 I 1 1 1 1 i 1 i 1 1 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

> 1 1 M 1 1 M II 1 II II 1 1 II 1 1 1 1 II 1 II 1 [ | 1 | M M M 1 II 1 M I 1 1 1 II 1 II 1 II II 




Db 


421 


NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 


480 


Qy 


481 


QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l 1 l 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M [ M | M M M | | M M M I M M 




Db 


481 


QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 


540 


Qy 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 l l l l I I l I I I I I I I I I I I I I I I I I I I I 
l l l l l M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 [ 1 1 1 1 1 j I 1 | 1 [ | | | | | | | | |[ | | | | | | | 




Db 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 


Qy 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 


660 






I I I I l l l l l t I I I I I I I I I I I I I I I I i i I I i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i 
1 H M 1 M M 1 1 1 1 1 1 1 M 1 1 1 1 ! 1 1 1 M 1 1 1 1 M M 1 1 1 II 1 I 1 1 I 1 1 1 I 1 i 1 1 1 1 1 1 1 




Db 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 


660 


Qy 


661 


NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 I 1 1 1 1 1 I 1 I 1 1 1 I 1 1 
1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I I M 1 1 1 I 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I I 




Db 


661 


NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 


Qy 


721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 






1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 l 1 l l l l l l l l l l l l l l l l l l I I I I I I I I I i i i 
1 1 M M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 M | | | | | | | M | j | | | | M | j | | | | | | | | | | j 




Db 


721 


PVDLFSDDS I PEVPQTQEEAVMLMKES LTEVSETVAQHKEERLSAS PQELGKP YLES FQP 


780 


Qy 


781 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 


840 






1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 




Db 


781 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 


840 


Qy 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l I I I I I I I I I I I I i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 


Qy 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 


Qy 


961 


RSLSAVLSAELSKT 974 




Db 


961 


1 II 1 1 1 1 1 1 1 1 1 1 1 
RSLSAVLSAELSKT 974 





RESULT 6 
AAU04591 

ID AAU04591 standard; protein; 1192 AA. 
XX 

AC AAU04591; 
XX 

DT 26-SEP-2001 (first entry) 
XX 

DE Human Nogo protein. 
XX 



KW Human; Nogo receptor; axonal growth; immunogen; antibody; nogo protein; 

KW cranial trauma; cerebral trauma; spinal cord injury; stroke; 

KW demyelinating disease; multiple sclerosis; monophasis demyelination; 

KW encephalomyelitis; multifocal leukoencephalopathy ; panencephalitis; 

KW Marchiaf ava-Bignami disease; pontine myelinolysis ; adrenoleukodystrophy; 

KW Pelizaeus-Merzbacher disease; Spongy degeneration; Alexander's disease; 



KW Canavan's disease; metachromatic leukodystrophy; viral infection; 

KW Krabbe ' s disease. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Domain 1054. .1119 

FT /label= Lumenal_extracellular_domain 

FT /note= "This sequence is specifically claimed" 

FT Peptide 1055. .1094 

FT /label- Pepl 

FT /note- "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1064. .1088 

FT /label= Pep2 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1074. .1098 

FT j /label= Pep3 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1084. .1108 

FT /label- Pep4 

FT /note= ''Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1095. .1119 

FT /label- Pep5 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

XX 

PN WO200151520-A2. 
XX 

PD 19-JUL-2001. 
XX 

PF 12-JAN-2001; 2 001WO-US001041 . 
XX 

PR 12-JAN-2000; 2000US-0175707P . 

PR 26-MAY-2000; 2000US-0207366P . 

PR 29-SEP-2000; 2000US-0236378P . 
XX 

PA (UYYA ) UNIV YALE . 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2001-442138/47. 

DR N-PSDB; AAS09453. 
XX 

PT Novel Nogo receptor protein useful for identifying modulator of Nogo 

PT protein or Nogo receptor protein, which is useful for treating central 

PT nervous system disorders. 
XX 

PS Example 1; Page 101-104; 109pp; English. 
XX 

CC The sequence is the human Nogo protein, a 250kDa myelin-associated axon 

CC growth inhibitor. The invention relates to the use of the nogo receptor, 

CC nogo protein, their nucleic acids, vectors expressing them and antibodies 

CC against them, to isolate agents which block nogo receptor mediated axonal 



CC growth. The agent is useful for treating a central nervous system 

CC disorder which is a result of cranial or cerebral trauma, spinal cord 

CC injury, stroke or a demyelinating disease selected from multiple 

CC sclerosis, monophasis demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, Marchiaf ava-Bignami disease, 

CC pontine myelinolysis , adrenoleukodystrophy, Pelizaeus-Merzbacher disease, 

CC Spongy degeneration, Alexander f s disease, Canavan's disease, 

CC metachromatic leukodystrophy, viral infection and Krabbe ' s disease 

XX 

SQ Sequence 1192 AA; 

Query Match 75.3%; Score 4403.5; DB 4; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 1.3e-221; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III 111:11 : I I I I I I I I II I I I II I I I I I I II : II : I 

Db 59 PAAGLS AAPVPT APAAGAP LMDFGND FVP PAP RGP L PAAP P VAP ERQ P S WD P S PVS STVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I II II I I I II I I I I I I I I I I I I I I II I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKR 17 8 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I 1 I I I I I I I I I I I I I I I I M III I I II I I : I : I I I I I I I I I I II I I I I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

II I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : I I I I I I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGSS FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I I I I I I I | | | | | | : : | | : I I : I I : : I I : I : I I : I I : I I I III 

Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I II I I I I II : : : I I I : I I I I I I I II M : I II I I : I I I : I I I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

11111:11 Mill: I I I I I : I I I I I I I I : I I I I I I I I I I : I I I I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I II I I I I I I : I I I I : II I II I I I II I I II I I I I I I I I I I I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 57 3 

III: I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I II : I I I I I I III 
Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 



574 



CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 
I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I : M I I I II I I : I : I I I I I I I 



Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

M II I I I I : I : I I : I II Mill: I I I : I I I I I I I I I I I I I I I I M M I I I : I I I 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS '752 

I : I i I : I I I : I I .: I : II I I I I I I : I I I I I I I I I I I I I : I II hi I I I : I I I I I I I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I I I I I I I I I I I I : : I : I I I : : : I I - I I I I I I I I I I 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVS^iKDDS-PKLAKEYTDLEVS 868 

: I I : I I I I II III : I : I : I I I I I I I I I I I I I I I I I : I : I II I I I : I I I II I I I 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNI YPK — DEVHVSDEFSENRSSVSKASISPSNV 92 6 

I I I I I I I I I I I I I I I I I I I I I I I : : : I I : II : I I : I I : I : I 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 98 6 

III I I : I I M I I I I I I I I I I I I I I I I I I I II: I I I I I I I I I I II I I II I I I 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALF 1106 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I i I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : 
Db 1136 NGLTLLI LALI S LFS VPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 7 
ABP68600 

ID ABP68600 standard; protein; 1192 AA. 
XX 

AC ABP68600; 
XX 

DT 14-JAN-2003 (first entry) 
XX 

DE Human pancreatic cancer expressed protein SEQ ID NO 71. 
XX 

KW Human; pancreas; cancer; gene therapy; vaccine; immunos timulant ; 

KW cytostatic; tumour. 

XX 

OS Homo sapiens. 
XX 

PN WO200260317-A2. 
XX 

PD 08-AUG-2002. 



PF 30-JAN-2002; 2002WO-US002781 . 
XX 

PR 30-JAN-2001; 2001US-0265305P . 

PR 31-JAN-2001; 2001US-0265682P . 

PR 09-FEB-2001; 2001US-0267568P . 

PR 21-MAR-2001; 2001US-027 8 651P . 

PR 28-APR-2001; 2001US-0287112P . 

PR 16-MAY-2001; 2001US-0291631P . 

PR 12-JUL-2001; 2001US-0305484P. 

PR 20-AUG-2001; 2001US-0313999P . 

PR 27-NOV-2001; 2001US-033362 6P . 
XX 

PA (CORI-) CORIXA CORP. 
XX 

PI Benson DR, Kalos MD, Lodes MJ, Persing DH, Hepler WT, Jiang Y; 
XX 

DR WPI; 2002-627435/67. 

DR N-PSDB; ABV94680. 
XX 

PT New isolated polynucleotide and pancreatic tumor polypeptides, useful for 

PT diagnosing, preventing and/or treating cancer, particularly pancreatic 

PT cancer. 
XX 

PS Claim 2; SEQ ID NO 71; 300pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated polynucleotide (I) comprising: (a) 

CC any of a group of over 4000 nucleotide sequences (ABV94628-ABV99145 ) ; (b) 

CC complements of (a) ; (c) sequences consisting of at least 20 contiguous 

CC residues of (a) ; (d) sequences that hybridize to (a) , under moderately 

CC stringent conditions; (e) sequences having at least 75% or 90% identity 

CC to (a); or (f) degenerate variants of (a). Polypeptides (ABP68596- 

CC ABP68637) encoded by (I) and oligonucleotide can be used to detect cancer 

CC in a patient and compositions comprising polypeptides, polynucleotides, 

CC antibodies, fusion proteins, T cell populations and antigen presenting 

CC cells expressing the polypeptide are useful in treating pancreatic cancer 

CC and stimulating an immune response. The polynucleotides can be used as 

CC probes or primers for nucleic acid hybridisation, in the design and 

CC preparation of ribozyme molecules for inhibiting expression of the tumour 

CC polypeptides and proteins in the tumour cells, in vaccines and for gene 

CC therapy. Note:" The sequence data for this patent did not form part of the 

CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 
XX 

SQ Sequence 1192 AA; 

Query Match 75.3%; Score 4403.5; DB 5; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 1.3e-221; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 Mill 11111111111111 I I I I I I I I II I : I I :: I I I I I I I I M I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III 111:11 : I I I I I I I I I I I I I I I II I I I I I I : II : I 
Db 59 P AAG L S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQPSWDPS P VS S T VP 118 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 22 5 

II II I I I I I I I I I I I II I I I I I I I I III I I I I I I : I : I I I I I I II I I I I I I I I I I 
179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

226 LS PLSTVS FKEHGYLGNLSAVS S SEGTI EETLNEAS KELPERATNPFVNRDLAEFSELEY 285 

Mill I I I I I I I I I II I : I I I : : I :: I I I I I : I : I :: I I I I I I I I I I 

238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEY 2 97 

286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I II I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I I I II I 

298 SEMGS S FSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVL7VA RANV 395 

III II I III : : : I I I : I I I I I I I I II I : Mill: I I Mill : I : 

358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

M M I : II Mill: I II I I : I I II I I I I : M I II II I I I : I I I I 
417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I II I I I I M II I M M I I : II M : II I II II I I I I I I I II II I I II I I I : 

477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM I : I I I I II M II I I II I II II II II I II I II I I : I I M I II : I I I II I I II 

537 KVT EEWANMP E GLT P DLVQEACE S E LN EVT GT K I AYET KMD LVQT S EVMQ E S L Y PAAQ L 596 

574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

M I I I I : I I II I I II I I I II I II I II : I II I II I : M I I I II I I : I : I II I I I I 

597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I II II I : I : I I : I II Mill: II I : I I I II I I I II II II I I I II II 11 = 111 
656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : II I : II I : I I : I : I M I I I II : II I I II II II I I I : II I I : I" I I I : I I II I I I 
716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 8 09 

I : : : : : : I : II I I I II I I I I II : : I MM : : : 11 = 11111 Mill 

776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 8 68 

: I I : I I II II III : I : I : I I I I II I I II I I I I I M MM M I II M II II I II 

836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

MIMI M II II II I I M I I I I I : : : I I : II : I I : M : I : I 

896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 



Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

III I I : I I I I I I I I I I I I I I I I I I I I I I I II: I II I I I I I I I I I II I II M 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLSVT I S FRI YKGVI QAI QKS DEGH P FRA 1046 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I II I I I I I I II I 
Db 1016 KTGVVFGAS LFLLLS LTVFS I VS VTAYI ALALLSVT I S FRI YKGVI QAI QKS DEGH P FRA 1075 

Qy 1047 YLES EVAI S EELVQKYSN S ALGHVNST I KELRRL FLVDDLVDS LKFAVLMWVFT YVGALF 1106 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 1076 YLES EVAI SEE LVQK Y SN S AL GHVN CT I KEL RRL FLVD D LVD S L K FAVLMWVFT YVGAL F 1135 

Qy 1107 NGLTLLIIALISLFSIPVIYERHQVQIDHYLGL7\NKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : 

Db 1136 NGLTLLIIALISLFSVPVIYERHQAQIDHYLGLANKNWDAMAKIQAKIPGLKRKAE 1192 



RESULT 8 
ABR59667 

ID ABR59667 standard; protein; 1192 AA. 
XX 

AC ABR59667; 
XX 

DT 22-JUL-2003 (first entry) 
XX 

DE Human NogoA protein. 
XX 

KW Human; Nogo receptor; NgR; CTS domain; neuroprotective; gene therapy; 

KW axonal growth; central nervous system; CNS; Nogo; spinal cord injury; 

KW cranial trauma; cerebral trauma; spinal trauma; stroke; Krabbe 1 s disease; 

KW demyelinating disease; multiple sclerosis; monophasic demyelination; 

KW encephalomyelitis; multifocal leukoencephalopathy; panencephalitis. 

XX 

OS Homo sapiens. 
XX 

PN WO2003031462-A2. 
XX 

PD 17-APR-2003. 
XX 

PF 04-OCT-2002; 2002WO-US032007 . 
XX 

PR 06-OCT-2001; 2001US-00972599 . 
XX 

PA (UYYA ) UN IV YALE. 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2003-393433/37. 

DR N-PSDB; ACC81048. 
XX 

PT New human Nogo receptor polypeptides and nucleic acids, useful for 

PT decreasing inhibition of axonal growth by a central nervous system 

PT neuron, or in treating central nervous system disease, disorder or 

PT injury, e.g. spinal cord injury. 
XX 

PS Disclosure; Page 131-135; 148pp; English. 
XX 



CC The invention relates to a novel nucleic acid encoding a polypeptide 

CC comprising amino acid residues 27-309 of a 473 amino acid sequence (PI, 

CC human Nogo receptor (NgR) NTLRRCT domain) , or residues 27-309 of PI with 

CC 1-20 conservative amino acid substitutions, and less than a complete CTS 

CC domain, provided that a partial CTS domain, if present, consists of no 

CC more than the first 39 consecutive residues. The nucleic acid of the 

CC invention has neuroprotective activity. The polynucleotide may have a use 

CC in gene therapy. The nucleic acid is useful for decreasing inhibition of 

CC axonal growth by a central nervous system (CNS) neuron. The NgR 

CC polypeptide or an agent inhibits the binding of Nogo to NgR or NgR- 

CC dependent signal transduction in the central nervous system neuron may be 

CC used in treating central nervous system disease, disorder or injury, e.g. 

CC spinal cord injury. Expression of an NgR protein may be associated with 

CC inhibition of axonal regeneration following cranial, cerebral or spinal 

CC trauma, stroke or a demyelinating disease, such as multiple sclerosis, 

CC monophasic demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, or Krabbe 1 s disease. The present 

CC sequence is used in the exemplification of the invention 

XX 

SQ Sequence 1192 AA; 

Query Match 75.3%; Score 4403.5; DB 6; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 1.3e-221; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:11111111 I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I I I I I ! I I I : I I : I I II I I I I I II I I I I I I I I I I I I : I I : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

M II I I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I | II | | | | | | | | || | | 
Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I 1.1 : : I : : I I I I I : I : I :: I I I I I I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGSS FKGS PKGESAI LVENTKEEVI VRS KDKED- LVCSAALHS PQES PVG KED 339 

I I I I I I I I I I I I I :: I I : II : I I : : I I : I : I I : I I : I I I III 
Db 298 SEMGSS FSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I II I I : I I I I I : I I I : I I I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYAJ)FKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

I I I I I : I I I I I I I : I I I I I : I I I I I I I I : I I I I I I I I I I : I I I I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 47 6 



Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I I I I II I I : I I M : I I I II I I I I I I I I I I I I I I I I I I I I : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLV7VAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : I I I I I I I I I I I I II I I I I I I I I I I I I I I II I : I I I I I I I : I I I I I I III 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I : I I I I M I I I I I M I I I I I I : I I I I I I I : I I I MM! I : I : I I I I I I I 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPS.PDF 692 

I I I I I I I I : I: I I : I II Mill: I II : I I I I I I I I I I M I I I I I I I I I I I : I II 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I II; II I : I I : I : I M I I I I I : II I I M I I II I II : M I Ml I I I : I I I I I I I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

M : :: : : | : || | I I I I I I t i J i = : I MM :: : M M I II I Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

M I : I I M II Ml : I : I : II I I I I I II I M I II II MM II II I M II II I I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 8 69 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 92 6 

II I I I I I I I I II II I I II I I I II : : : II M I M I : I I : I M 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 98 6 

I I I I I : I I M I I I II I I I II I I I I I II I I I M I II II I I I I I I I I I I I I I I 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI S FRI YKGVIQAIQKSDEGHPFRA 1046 

M I I I I II I I I I I I I II II | || | | | | || II I I II II II I I I I II I I II I II I II I II I I I 

Db 1016 KTGVVFGASL FLLLS LTVFS I VSVTAYI ALALLSVTI S FRI YKGVI QAI QKSDEGHP FRA 1075 

Qy 1047 Y L E S E VAI SEE L VQ K Y S N SAL GHVN STIKELRRLF LVD D LVD S L K FAVLMWVFT YVGAL F 1106 

I I I I I I I I I II I I I I II I II I I I II I II II I I I I I I I I I II I I I I I II I I I II II I I I I 

Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I I I I I I I I M I II II I I I I I I I I I I II I M II I I I I II II I I I II I I I :. 
Db 1136 NGLTLLI LALI SLFSVP VI YERHQAQIDHYLGLANKNVKDAMAKIQ7VKIPGLKRKAE 1192 



RESULT 9 
AAY56967 

ID AAY56967 standard; protein; 1192 AA. 
XX 

AC AAY56967; 
XX 

DT 25-APR-2000 (first entry) 
XX 



DE Human MAGI polypeptide. 
XX 

KW MAGI protein; neuroendocrine-specif ic protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotective; neuroprotective. 

XX 

OS Homo sapiens . 
XX 

PN WO200005364-A1. 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 99WO-GB002360 . 
XX 

PR 22-JUL-1998; 98GB-00016024 . 

PR 19-JUL-1999; 99GB-00016898 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 

DR N-PSDB; AAZ56886. 
XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders. 
XX 

PS Claim 2; Page 20-21; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents the human MAGI protein 

XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 4398.5; DB 3; Length 1192; 
. Best Local Similarity 75.9%; Pred. No. 2.3e-221; 
Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I : I I :: I I I I I II I I I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL SAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 




Db 



59 P AAG L S AAP VPT AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP 



118 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE 
III I I I I I I I I I I I I I I I I I I I I I I I I II II 



PAAPPSTPAAPKR 



II I I I I I 1 I I I I I 



166 



119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQA£PVWTPPAPAPAAPPSTPAAPKR 178. 

167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS- 225 

I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I : I : I I I I I I Ml I M I I I I I I 
179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I II I : I : I : : I I I I I I II I I 

238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEY 297 

2 86 SEMGS S FKGS PKGESAI LVENTKEEVI VRS KDKED- LVCSAALHS PQES PVG KED 33 9 

I I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I I I III 
298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

34 0 RWSPEKTMDIFNEMQMSWAPVREEYMFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I II : : : I I I : I I I II I I I I I I : I I I I I : I I I : I I I : I : 

358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

11111:11 Mill: I I II Ml II II I II Ml II I I II I Mill 
417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I M I I I I M M I I I I I I I I : I I I I : I I I M I I I I II I I I MM II II I I I : 

477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM I M I I I I I I II I M I II I I I I M I I I I II I I II M I I II I I M I I II I III 

537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 
574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I II I I I M II I I I I I II I I I I II I I I M II II I I M I I II I I I I M M II I II I 

597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

634 PPPYEE7\MNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 
I M I I I I I M M I : I II Mill: I I I : I I I I I I I I I II I M I M I II I II M II 

656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I M I I : I I I : I I : I : I I I I II I I : II I M I M M I II M II Ml I II : II I I I I I 
716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

M : : : : : I : I I I I I II I I I II I : : I : I II : : : I I M II I I II I M 

776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 8 68 

M I : I I I I I I III : I : I M I I I M I I II II II I I I MM II II I M II I I I I I 
836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

8 69 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVS DEFS ENRS S VS KAS I S P SNV 92 6 

II M I I M MM Ml Ml Ml M ::: I I : I I : I |: || : | :| 

896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml I M I I I I I I I I I I I I I I I I I I I I I I I I I : I If I I I I I I I I I II I I I I I 
956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 



Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I I I I I I I I I I I M I I I I I I II I I I I I I I I II I I II I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVIJyiWFTWGALF 1106 

I I I I I I M I I II I J I I I I I I I I I I I I II I I I 

Db 1076 YL E S EVAI SEE LVQ K Y S N SAL GHVN CT I KE L RRL FLVD D LVD S L KFAVLMWVFT YVGAL F 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I : I I I I II I I I I I I I I I I I I I : I I I I II I I I I I I I I I I I I I : 
Db 1136 NGLTLLIL^ISLFSVPVIYERHQAQIDHYLGIJ\NKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 10 
AAB82349 

ID AAB82349 standard; protein; 1192 AA. 
XX 

AC AAB82349; 
XX 

DT 23-JUL-2001 (first entry) 
XX 



DE Human NOGO-A protein. 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury; brain injury; 

KW stroke; neuronal degeneration; Alzheimer's disease; Parkinson's disease; 

KW neuromuscular disorder; psychiatric disorder; developmental disorder; 

KW neuroprotective; nootropic; neuroleptic; antiparkinsonian; 

KW cerebroprotective; neuroleptic; diagnosis; therapy. 
XX 

OS Homo sapiens. 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2000WO-GB004345 . 
XX 

PR 15-NOV-1999; 99GB-0002 6995 . 

PR 24-JAN-2000; 2 000GB-00001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR N-PSDB; AAF90324. 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases. 
XX 

PS Disclosure; Page 26-27; 25pp; English. 
XX 

CC The present sequence is that of human NOGO-A. NOGO-A is a previously 

CC known splice variant of the human NOGO gene on chromosome 2p21. The 

CC invention relates to a novel splice variant, NOGO-C (see AAB82348) . It 



CC provides NOGO-C polypeptides and polynucleotides, and methods for 

CC producing such polypeptides by recombinant techniques. Also disclosed are 

CC methods for utilising NOGO-C polypeptides and polynucleotides in the 

CC treatment of diseases including neuropathies, spinal injury, brain 

CC injury, stroke, neuronal degeneration, for example Alzheimer's disease 

CC and Parkinson's disease, neuromuscular disorders, psychiatric disorders 

CC and developmental disorders. Also provided are methods for identifying 

CC agonists and agonists for use in treating conditions associated with NOGO 

CC -C imbalance, and diagnostic assays for detecting diseases associated 

CC with inappropriate NOGO-C activity or levels 

XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 4398.5; DB 4; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 2.3e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL S AAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I ! I I I I I I II III llhll : I I I I I I I I I I I I I I I I I I I I I I I : II : I 

Db 59 P AAGL S AAP VP T AP AAGAP LMD FGN D EVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I I I I I I I I I I I I I I I I 

Db 179 RGS SGSVDETLFALPAAS EPVI RS SAEN-MDLKEQPGNTI SAGQEDFPSVLLETAASLPS 2 37 

Qy 22 6 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 2 85 



Db 



238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 2 97 



Qy 



286 SEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 




Db 



298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 



QY 



Db 



340 RWS PEKTMDI FNEMQMS WAP VREE YAD FKP FEQAWEVKDT YEGS RDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : I : 

358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DM1AAGGKIESNL 416 



Qy 



Db 



396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

I I I I I : I I Mill: I I I I I : I I I I I I I I : I I I I II I I I : I I I I 
417 ESKVDKKCFMSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 



Qy 



Db 




Qy 

Db 



514 
537 



573 
596 



Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I : I II I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I I : I : I I I I I I I 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : h I I : I II Mill: I I I : I I I I I I I I I I I I I I I I I I I I I I I : I I I 
Db 656 PPPYEE7\MSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEI7VKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : II : I : I I I I I I I I : I I I I I I I I I I I I I : I I I I : I I I I : I I I I I I I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLES FQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEF 809 

I : : : : : : I : I I I I I I I I I I I I I : : I : I I I : : : I I : I I I I I I I I I I 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : I I I I I I III : I : I : I I I I I I I I I I I I I II I I : I : I II I I I : I I I I I I I I 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK--DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I I I I I I I I I I I I I I I I I I I I I : : : I I : I I : I I : I I : I : I 
Db 896 "HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 98 6 

III I hllll I I I I I I I I I I I I I I I I I I I II: I I I I I I I I I I I I I I I I I I I 
Db 956 SAIATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIAIALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLES EVAI S EELVQKYSNSALGHVNST I KELRRL FLVDDLVDS LKFAVLMWVFT YVGALF 1106 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I II I I I I : I I I I I I I I I I I J I I I I H I : 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ7VKIPGLKRKAE 1192 



RESULT 11 
ABG30938 

ID ABG30938 standard; protein; 1192 AA. 
XX 

AC ABG30938; 
XX 

DT 21-OCT-2002 (first entry) 
XX 

DE Human NogoA protein. 
XX 

KW Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 

KW stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 

KW neuroblastoma; hyperprolif erative disorder; dysprolif erative disorder; 

KW cirrhosis; psoriasis.; keloid formation; fibrocystic condition; cancer; 

KW tissue hypertrophy; central nervous system; axon regeneration; NogoA; 

KW . Nogo-associated disease; metastasis. 



OS Homo sapiens . 
XX 

PN WO200257483-A2. 
XX 

PD 25-JUL-2002. 
XX 

PF 18-JAN-.2002; 2002WO-GB00022 8 . 
XX 

PR 18-JAN-2001; 2001GB-00001312 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Blackstock WP f Hale RS, Prinjha R, Rowley A; 
XX 

DR WPI; 2002-599722/64. 

DR N-PSDB; ABK90134. 
XX 

PT Identifying modulators of Nogo or BACE activity for treating acute . 

PT neuronal injuries, neoplastic or dysprolif erative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides. 
XX 

PS Disclosure; Page 59-62; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas), hyperprolif erative or dysprolif erative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques. The present amino acid sequence represents the human NogoA 

CC protein of the invention 

XX 

SQ Sequence 1192 AA; 

. Query Match 75.2%; Score 4398.5; DB 5; Length 1192; 
Best Local Similarity 75.9%; Pred. No. 2.3e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I : I I :: I I I I I I I II I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 



61 PAAGL S AAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA — -P 115 

I I I I I I I I II Ml I M : I I : I I I I I II I I I II I II I I I I I M I : II : I 
59 PAAGL SAAPVPTAPAAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQPSWDPS P VS S T VP 118 

116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

I II I I I I I I I I I I I I I I I I I I I I I I I I II || I I I I I I I I I I I I I 

119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

167 RG-SGSVDETLFALP7VASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I Ml I I MM III II I II I : I : I I II I I I I I I I I I I II I I 

179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 28 5 

I II I I I I I I I I II I I I I : I || :: | :: | | | | | : | : | :: I M II I I I I I 

238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHSPQES PVG KED 339 

I I II I I I III M I : : I I : | | : M : : I I : I : II : I I : II I I I I 

298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : I : 

358 EWS S EKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDS KEDS - DMLAAGGKI ESNL 416 

396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPYKDSSRAYITCAS FT-SATESTT 454 

I I M I : II Mill: I II I I : I I II M I I : I I I I II II I Mill 
417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPTVATESIA 47 6 

455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I II I II I II I I II I I II M II M I I I II II II II I I I MM II I I I I I : 
477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I II I : I I II I I I I I I II I I I M I I II I II I I II I II M I II II I M I I I I I II I 

537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

II I M I M II I I I I I II II I II I I I I M I I I I II M I I Mill I : I : I I I II II 

597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I M I I I I I : I M I : I II Mill: I II : I I I I II I I II I II I I II I I II II : I II 
656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

693 SN YSEI AKFEKSVPEHAELVEDS S PES EPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVS 752 

I M I I : I I I : I I : I : II I M I I I : I I I I I I I I I I I II : II I Ml I II : I II II I I 
716 SDYSEMAWEQPVPDHSELVEDSSPDS EPVDLFSDDS I PDVPQKQDETVMLVKESLTETS 775 

753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : =1 = 111 I I I I I I I I I I : : I MM : : : I I : I II II I II I I 
776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : I I I I II I II : I : I : I II I I I I I M II II I I I : I : I II I I I : I I II I II I 
836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 



Qy 8 69 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 92 6 

MINI I I I I I I I I I I I I I I I I I : : : I I : I I : I I : II : I : I 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 98 6 

III I I : I I I I I I I I I I I I I I I I I I I I I I I II: I I I I I I I I I I I I I I I I I I I 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAI F5AELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFR^ 1075 

Qy 1047 YL E S EVAI S EE LVQ K Y S N SAL GHVNS T I KE L RRL FLVD D LVD S L KFAVLMWVFT YVGAL F 1106 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1076 YLES EVAI SEELVQKYSNSALGHVNCT I KELRRL FLVD DLVDSLKFAVLMWVFT YVGAL F 1135 

Qy 1107 NGLTLLIL7\LISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I : I II I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 12 
ABB81078 

ID ABB81078 standard; protein; 1192 AA. 
XX 

AC ABB81078; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Human neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; human; receptor. 
XX 

OS Homo sapiens. 
XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-00893348 . 
XX 

PR 19-MAY-1998; 98IL-00124500 . 

PR 21-JUL-1998; 98WO-US0147 15 . 

PR 22-DEC-1998; 98US-002 18277 . 

PR 19-MAY-1999; 99US-00314161 . 
XX 

PA (YEDA ) YE DA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 



DR N-PSDB; ABN86601. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Example; Page 53-56; 93pp; English. 
XX 

r~ 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the human 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen 

XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 4398.5; DB 5; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 2.3e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA— P 115 

I I I I I I I I II III llhll : I I I I I I I I I I I I I I I I I I I I I I I : II : I 
Db 59 P AAGL S AAP VPT AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQPSWDPS P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 



Qy 167 RG- S GS VDETL FAL PAAS E PVI P S S AEKIMDLMEQPGNTVS S GQEDFP S VLLETAAS LPS 225 . 

II II I I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I I I I I I I I I I I I I I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : II I I I I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 2 86 S EMGS S FKG S P KGE S AI LVENT KEEVI VRS KDKED- LVC S AALH S PQE S P VG KED 339 

I I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I I I III 

Db 298 SEMGSSFSVSPKAESAVIV7\NPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I II I I I I I I I I : II I I I : I I I : II I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

11111:11 Mill: I I I I I : I I I I I I I I : I I I I I I I I I : I I I I 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I II I I I M : II I I : I I I II I I I II I I I I I I II MINI I : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVT EAAVS NMP E GLT P DLVQEAC E S E LN EAT GT K I AYET KVDLVQT S EAI Q E S L Y PT AQL 573 

I I I I I : I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I : II I I I I I : I I I II I III 

Db 537 KVT E E WANMP E GLT P D LVQEACE S E LN EVT GT K I AYET KMD LVQT S EVMQ E S L Y PAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I II I : I I II I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I I : I : I I I I I I I 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSWYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : I : II : I II Mill: I I I : I I I I I I I I I I I I I II I II I I I II : I I I 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLS7VEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : II : M I II I II I I : I I I I II I I II I II : I I I Ml I I I : I I I II I I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : M I I I I I I I II I I I M : I MM : : : I I : II II I Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: II : I I I I I I 111 : I : I : I I I I I I II II I I I I I I I MM M I II M I I II I I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVS KASISPSNV 926 

I I I II I I I I I II I I I I I I I II II : : : I I M I : I M II : I : I 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

I M I I : I I I I I I M I I I I I I II II I I II I II: I I I I M I I I I II I I I II I I 

Db 956 SALATQAEI ES I VKPKVLVKEAEKKLPSDTEKEDRS PSAI FSAELS KTS WDLL YWRDI K 1015 

Qy 987 KTGVVFGASL'FLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRA. 1046 



I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 1016 KTGVVFGAS LFLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGH P FRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I I I I I I I I I I I I I I I I I M I I I I M II I I I I I I I I I I I I I I I I II I II I I I I I I M I I I 

Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I M I I I II II II : I I I I I II I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : 

Db. 1136 NGLTLLI IALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 

RESULT 13 
AAY71311 

ID AAY71311 standard; protein; 1178 AA. 
XX 

AC AAY71311; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Human neurite growth inhibitor Nogo. 
XX 

KW Human; neurite growth inhibitor; Nogo; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Misc-dif f erence 187 

FT /label= Unknown 

FT Misc-dif ference 188 

FT /label= Unknown 

FT Misc-dif ference 189 

FT /label= Unknown 

FT Misc-dif ference 190 

FT /label= Unknown 

FT Misc-dif ference 221 

FT^ /label= Unknown 

FT Misc-dif ference 328 > 

FT /label= Unknown 

FT Misc-dif ference 477 

FT /label= Unknown 

FT Region 977. .1012 

FT /note= "Region specifically described in claim 16" 

FT Region 994. .1174 

FT /note= "Region specifically described in claim 16" 

FT Region 1079. .1114 

FT /note- "Region specifically described in claim 16" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 



PF 05-NOV-1999; 99WO-US02616G . 
XX 

PR 06-NOV-1998; 98US-010744 6P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN MS. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 11; Fig 13; 122pp; English. 
XX 

CC The present sequence is a human Nogo protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The human Nogo 

CC sequence was derived by aligning human expressed sequence tags (ESTs) 

CC e.g. AA158636, AA333267, AA081783, AA167765, AA322918, AA092565, AA081525 

CC and AA081840 with the rat Nogo sequence. Nogo proteins and fragments 

CC displaying neurite growth inhibitory activity are used in the treatment 

CC of neoplastic disease of the CNS e,g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 

CC Alzheimer's and Parkinson 1 s diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysproliferative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1178 AA; 



Query Match 73.1%; Score 4276.5; DB 3; Length 1178; 

Best Local Similarity 73.8%; Pred. No. 5.5e-215; 

Matches 883; Conservative 104; Mismatches 158; Indels 51; Gaps 19 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I I I I I I I I II III I I I I I I II I I I I I I I I I II : II :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSVPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

QY 61 PAAGLSAAAVP — P AAAAP L L D F S S D S VP PAP RG P L P AAP P AAP E RQ P S WE RS P AA- — P 115 

M I I I I I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I I I II I : II : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE' 



P AAP P S T P AAP KR 166 



119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

I I I : I I I I I I II I I I : I : I II I I I II I I HI I I II I 

179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

I I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : I I I I I I I I M I 

225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

287 EMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALH5PQESPVG- KEDR 34 0 

I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : II III I I I 

285 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : | : | 

345 WS S EKAKDS FNEKRVAVEAPMREE YADFKP FERVWEVKDS KEDS - DMLAAGGKI ESNLE 4 03 

397 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

I I I I : I I Mill: I I I I I : I I I I I I I I : I I I I I II I I I : I I II 
4 04 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 4 63 

456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

I I I I I I I I I I I I I I I I I I I : I I I I : I I I II I I I II I II III MINI I : I 
4 64 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSCTDYVTTDNLTK 523 

515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I : I I I I I I I =111111 I I I I 
524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

575 P S FEEAEAT P S P VL PD I VMEAP LN S LL P S AGAS WQ P S VS P LEAP P P VS YDS I KLE P EN P 634 

I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I I : I : I I I I II I I 
584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

M I I I I I : I : II I II Mill: I I I : I I I I I I I I I II I I I I I II I I I I I : I I I I : 
643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

IIMM I : I I : I : I I I I I I I I : I I II I I II I I II I : II I Ml I I I : I I I I I I I I 
703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 811 
: : : : : : I : I I I I I MINIM: M MM : : : I I : I I I M I I I I I : I 

7 63 SMIEYENKEKLSALPPEGGKPYLESFKLSLVNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 87 0 

I M I I I I I III : I : I : I! I I I I I I I I I II I I I I MM II I I I : M I M I I I I 
823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 8 82 

871 SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 
Mill II I II I I II M I I I I I I : : : I I : I I M I : I I : I : II I 

8 83 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

92 9 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 
I I I : MM I I I I M I I I I I I I I II M I II: MM I I I I I M I I I I I I II 



Db 


943 


LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 


1002 


Qy 


988 


TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 


1047 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
I I l I l l l l l 1 1 1 1 l l l l 1 I l I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 




Db 


1003 


TGWFGAS LFLLLS LTVFS I VS VTAYI ALALL S VT I S FRI YKGVI QAI QKS DEGH PFRAY 


1062 


Qy 


1048 


L E S E VAI SEE L VQ K Y S N SAL GHVN S T I K E L RRL FL VD D LVD S L K FAVLMWVFT YVGAL FN 


1107 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 




Db 


1063 


LESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFT YVGAL FN 


1122 


Qy 


1108 


GLTLLI LALI SLFSI PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 








1 1 1 II 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II : 1 1 1 1 1 1 1 1 1 I I I I I I M | I : 




Db 


1123 


GLTLLI LAL I S L FS VPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 





RESULT 14 
AAU33228 



ID AAU33228 standard; protein; 1246 AA. 
XX 

AC AAU3322 8; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Novel human secreted protein #3719. 
XX 

KW Human; vaccination; gene therapy; nutritional supplement; 

KW stem cell proliferation; haematopoiesis ; nerve tissue regeneration; 

KW immune suppression; immune stimulation; anti-inflammatory; leukaemia. 

XX 

OS Homo sapiens . 
XX 

PN WO200179449-A2. 
XX 

PD 25-OCT-2001. 
XX 

PF 16-APR-2001; 2001WO-US008656 . 
XX 

PR 18-APR-2000; 2000US-00552929 . 

PR 26-JAN-2001; 2001US-00770160 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT; 
XX 

DR WPI; 2001-611725/70. 
XX 

PT Nucleic acids encoding a range of human polypeptides , useful in genetic 

PT vaccination, testing and therapy. 

XX 

PS Claim 20; Page 737; 765pp; English. 
XX 

CC The invention relates to novel human secreted polypeptides. The 

CC polypeptides and antibodies to the polypeptides are useful for 

CC determining the presence of or predisposition to a disease associated 

CC with altered levels of polypeptide. The polypeptides are also useful for 

CC identifying agents (agonists and antagonists) that bind to them. Cells 

CC expressing the proteins are useful for identifying a therapeutic agent 



CC for use in treatment of a pathology related to aberrant expression or 

CC physiological interactions of the polypeptide. Vectors comprising the 

CC nucleic acids encoding the polypeptides and cells genetically engineered 

CC to express them are also useful for producing the proteins. The proteins 

CC are useful in genetic vaccination, testing and therapy, and can be used 

CC as nutritional supplements. They may be used to increase stem cell 

CC proliferation; to regulate haematopoiesis ; and in bone, cartilage, tendon 

CC and/or nerve tissue growth or regeneration; immune suppression and/or 

CC stimulation; as anti-inflammatory agents; and in treatment of leukaemias. 

CC AAU29510-AAU33304 represent the amino acid sequences of novel human 

CC secreted proteins of the invention 

XX 

SQ Sequence 124 6 AA; 

Query Match 70.4%; Score 4116; DB 4; Length 1246; 

Best Local Similarity 72.0%; Pred. No. 1.4e-206; 

Matches 873; Conservative 112; Mismatches 171; Indels 56; Gaps 27; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111 = 111 I I I I I I I I I M I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I I I M M I 
Db 42 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 99 

Qy 61 PAAGL S AAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III I I I : I I : I I I I I I I I I I II I I I I I I I I I I I : II : I 
Db 100 PAAGL SAAPVPTAPAAGAP LMD FGND FVP PAP RGP L PAAP PVAP ERQP S WD P S PVS S TVP 159 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 

Db 160 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 219 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

I I I I I I I I I I I II I I I I I I I I I I I I III I I I II I : I : I I I I I I I I II I I I I I I I I 
Db 220 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 278 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I I II : I : I : : | | | I I I I I I I 

Db 279 LSPLST^SFKEHEYLGNLSTVXPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 338 

Qy 286 SEMGS S FKGS PKGESAILVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I I I I I M I I I I I I : : I I : | | : | | : : | | : | : | | : I I : II I Ml 
Db 339 SEMGS S FS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 398 

Qy 34 0 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RAW 395 

I M I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : | : 

Db 399 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 457 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

I M I I : I I I I I I I : I II I I : I I I I I I I I : I I I I I I I I I I : I II I 
Db 458 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIA 517 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I II I I I I I : I I I I : I I I || I II I I I I I I I | | | I I I I I I I : 
Db 518 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 577 

Qy - 514 KWEAAVSmPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

INI I : I I I I I M I I I I I I II I I I | I | || | | | | | | | | : | | | | | | | : I I I I | I Ml 
Db 57,8 KVTEEWANMPEGLT P DLVQEACES ELNEVTGTKI AYETKMDLVQT S EVMQE S LYPAAQL 637 



Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I II I : I I I I I I I I I I I I I I I I M I : I I I I II I : I II I I I I I I : I : I I I I I I I 
Db 638 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 696 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFN7\AVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : I : II : III Mill: I I I : I I I I I I I I I I I I I I I I I I I I I I I : I I I 
Db 697 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 7.56 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : I I : I : I I I I I I I I : I I I I I I I I I I I I I : I I I I : I I I I : I I I I I I I 
Db 757 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 816 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I I I I MINIM: : | MM : : : I I M I I I I I I I I I 

Db 817 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 876 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : I I I I I I Ml : I : I : II I M I I M I M I I I I I MM I I I I I : M I I I I I I 
Db 877 STAVYSNDDLFI SKEAQI RETETFSDSS PIEI IDEFPTLI SSKTDSFSKLAREYTDLEVS 936 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I M I II I I II III Ml Ml II : : : II : I I : I M I I : I : I 
Db 937 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 996 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

I M I I : I I I I I I I I II I II I I I I I M II I II: M I I I I I I II II I II II I I 
Db 997 S7VLATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIK 1056 

Qy 987 KTGWFGAS-LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFR 1045 

I I I I I I I I I : II I I I I M I II I M I II I I I I I II I I II M I I I I II I I I II I I I I I I I I 
Db 1057 KTGWFGASAVFLLLS LTVFS I VSVTAYI ALALLS VT I S FRI YKGVIQAI QKS DEGHP FR 1116 

Qy 104 6 AY LES EVAI S EELVQKYSNSALGHV-NST I KELRR L FLVD DLVD S L K- FAVLMW 1097 

I I | | : : | : M II I I I : I I : I I I I I II M : I I I I I I 

Db 1117 AISGNLESCLYLRELGSGRYSNSALGSMWNCTVKGNFRAPSFFSWMDLVDSLRSFAVLMW 1176 

Qy 1098 VFTYVGALFNGLTLL 1 LALI S L FS I PVI YERHQVQI DHYLGLANKS VKDAMAKI 1151 

II I I I I I I I : I I II : I I I I I I II I I I I I I II I : I I I II I I I 

Db 1177 VFTWGCL--GLMVXDTTGFWALNFISSSGSWLIYERHQAQIDHYLGIANKNVKDAMAKI 1234 

Qy 1152 QAKI PGLKRKAD 1163 

I II I I II II I I : 
Db 1235 QAKI PGLKRKAE 1246 



RESULT 15 
AAY71562 

ID AAY71562 standard; protein; 803 AA. 
XX 

AC AAY71562; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A protein fragment used in the construction of mutant NiG. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 



KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer f s disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. - 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US0261 60 . 
XX 

PR 06-NOV-1998; 98US-010744 6P . 
XX 

PA (SCHW/) SCHWAB ME. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids, useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce . regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is a fragment of rat Nogo A protein shown in AAY71310, which is used in 

CC the construction of mutant NiG. The mutant is composed of His-tag/T7- 

CC tag/Nogo-A sequence aa 172-974/His-tag . Nogo A deletion mutants were used 

CC for mapping the inhibitory sites of Nogo protein. Major inhibitory region 

CC was identified in the Nogo A sequence from amino acids 172-974, 

CC particularly amino acids 542-722. In addition, N-terminal region 1-171 

CC was found to be inhibitory to NIH 3T3 fibroblast spreading. Note: The 

CC present sequence is not given in the specification but is derived from 

CC rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification. 

CC However, the specification does not include sequences for these SEQ ID 



CC numbers 
XX 

SQ Sequence 8 03 AA; 



Query Match 68.8%; Score 4023; DB 3; Length 803; 

Best Local Similarity 99.9%; Pred. No. 6e-202; 

Matches 802; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 172 VDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLST 231 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1 VDET L FAL PAAS E P VI P S S AEK IMD LMEQ P GNT VS S GQED FP S VLL ET AAS LPSLSPLST 60 

Qy 232 VSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSS 291 

I I I I I I I I M M I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I II I I I I I I I I II I I I 
Db 61 VSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSS 120 

Qy 292 FKGSPKGESAILVENTKEEVIVRSKDKEDLVCST^ALHSPQESPVGKEDRVVSPEKTMDIF 351 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I II 

Db 121 FKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIF 180 

Qy 352 NEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVIJUVRANVESKVDRKCLEDSLEQK 411 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I : I I I I I I I 
Db 181 NEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLA7VR7^VESKVDRKCIEDSLEQK 240 

Qy 412 SLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTD 471 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 241 SLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTD 300 

Qy 472 EKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYWTDTLSKVTEAAVSNMPEGLTPDL 531 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 301 EKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDL 360 

Qy 532 VQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI 591 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 VQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI 420 

Qy 592 VMEAP LN S L L P S AGAS WQ P S VS P L EAP P P VS YD S I KL E P EN P P P YE EAMNVALKALGT K 651 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 421 VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTK 480 

Qy 652 EGIKEPESFN7\AVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAEL 711 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I II II I I I I I 

Db 481 EGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAEL 540 

Qy 712 VEDSS PES EPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS PQELG 771 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 541 VEDSS PESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS PQELG 600 

Qy 772 KPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESE 831 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 KPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESE 660 

Qy 832 TFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 891 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I M I I I I I I I I I I I I I I I 
Db 661 TFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 720 

Qy 8 92 LSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKK 951 



Db 721 L S FKNI YPKDEVHVS DE FS ENRS SVS KAS I S P SNVS ALEPQTEMGS IVKS KS LTKEAEKK 78 0 

Qy 952 LPSDTEKEDRSLSAVLSAELSKT 974 

I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 LPSDTEKEDRSLSAVLSAELSKT 8 03 



Search completed: September -3, 2004, 16:05:24 
Job time : 89.2518 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



September 3, 2004, 16:02:30 ; Search time 22.9601 Seconds 

(without alignments) 
2615.013 Million cell updates/sec 

US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKI PGLKRKAD 1163 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



389414 



Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A_COMB . pep : * 

2: /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3: /cgn2_6/ptodata/2/iaa/6A_COMB.pep: * 

4: /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep: * 

6 : /cgn2_6/ptodata/2/iaa/backf ilesl .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-700-607-1 

Sequence L^Applic^on US/08700607 
Patent N< £T 5858708^ 11^ 
GENERAL INFORMATION : 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP : 94304 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 199 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: 

CLONE: Consensus 
US-08-700-607-1 

Query Match 15.5%; Score 908; DB 2; Length 199; 

Best Local Similarity 96.3%; Pred. No. l.le-50; 

Matches 184; Conservative 3; Mismatches 4; Indels 0; Gaps 0; 

Qy ^^973^TSWDLLYWRDIKKTGW 1032 

f ,.„. J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db Q^KDKVVT)LLYWRDIKKTGvVFGASLFLLLSLTV 68 

Qy 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 69 QAI QKS DEGHP FRAYLES EVAI S EELVQKYSNSALGHVNCT I KELRRLFL VDDLVDS LKF 12 8 

Qy 1093 AVLMWVFT YVGALFNGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQ 1152 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I 
Db 129 AVLMWVFT WGALFNGLTLLI LALI SLFSVP VI YERHQAQIDHYLGIANKNVKDT^MAKIQ 188 

Qy 1153 AKI PGLKRKAD 1163 

I I I I I I I I I I : 
Db 189 AKI PGLKRKAE 199 



RESULT 2 
US-08-700-607-5 

Sequence 5, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Bandman, Olga 
Au-Young, Janice 
Goli, Surya K. 
Hillman, Jennifer L. 



TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE : Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 

COUNTRY: U.S. 
; ZIP: 94304 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/700,607 

FILING DATE: Filed Herewith 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,74 9 

REFERENCE/ DOCKET NUMBER: PF-0114 US 
; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS : 

LENGTH: 776 amino acids 

TYPE: amino acid 
; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 307307 
US-08-700-607-5 

Query Match 13.5%; Score 789.5; DB 2; Length 776; 

Best Local Similarity 31.2%; Pred. No. 3.1e-42; 

Matches 243; Conservative 115; Mismatches 252; Indels 169; Gaps 28; 

QY 4 87 TSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGT 546 

: I :| I I:: : : I : I | | :| : | 
Db 65 SGPARQSP — VAMET AS T GVAGVS S AMD HT FSTTSKDGEG SCYTSLI S 110 

Qy 547 KIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI — VMEAPLNS 599 

I I s I : : I | : : | | . | MINI: : I I 

Db 111 DICYPPQEDSTYFTGILQKENGHVTISESP EELGTPGPSLPDVPGIESRGLFSSDSG 167 

Qy 600 — LLPSAGASWQPSVSPLEAPPPVSY DSIKLEPENPPPYEEA M 641 

: I: I : "II: :| : :| : :: | |: 

Db 168 IEMTPTVESTEWKIIADPLDQMKT^iEAYKYIDITRPEEVKHQEQHHPELEDKDLDFKNKDT 227 

Qy 642 NVALKALGTKEGIKEPE SFNAAVQETEAPYISIACDLIKETKLSTE-PSP 690 

— = I I I : : I I : : : I I I I I I I : I : : : : I 

Db 228 DISIK PEGVREPDKPAPVEGKI I KDHLLEESTFAPYI D DLSEEQRRAPQITTP 280 



Qy 



691 DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSIPEV PQTQEEAVMLMK 745 



Db 281 VKITLTEIE PSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPEDDSPGSITPP 334 

Qy 746 ESLTEVSETVAQHK EERLSASPQELGKPYLESFQP NLHSTKDAASND 792 

I II I : I I : | :: | : | | : | | : : 

Db 335 SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQIADRPEVKARSGPPT 394 

Qy 793 IPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKI 827 

II:- I I : : : I I : : I I : 

Db 395 I PSPLDHEASSAESGDSEI ELVSEDPM — AAEDALPSGYVSFGHVGGPPPSPASPSIQYS 452 

Qy 828 KESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSG 879 

:| I I I 1:11 || ::||| :: :: | | : 

Db 453 ILREEREAELDS ELIIESCDASSASEESPKREQDSPPMKPSALDAI REETGVRAEER 509 



Qy 880 ADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIV 939 

II II 11:111 I 11111:11 
Db 510 APSRRGLAEPG — SFLD-YPSTEPQPGPEL PPGDGALEPETPM 54 9 

Qy 940 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRD 984 

II I : : I : : : | : | : I I I I I I I 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRD 597 

Qy 985 rKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPF 1044 

M : I I : I I I : I II III I I : I I I I I : I I I II I I I I I I I I I : I I : I I : I I I I I I 
Db 598 I KQTGI VFGS FLLLLFS LTQFS WS VVAYLALAALS AT I S FRI YKS VLQAVQKTDEGHP F 657 

Qy 1045 RAYL E S EVAI SEE LVQ K Y S N S ALGHVNS T I KE L RRL FLVDD L VD S L KFAVLMWVFT YVGA 1104 

: I I I I I : : I : I : II I : : : II I I : I I I I I I I I I I I I I I I I I II I I I : I I I I I 

Db 658 KAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGA 717 

Qy 1105 LFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAXIPGLKRKAD 1163 

I I I I II I I : : I : : I : I : : I I : I : I | I M | | | | : : I I I I I I I I I I I I : 

Db 718 LFNGLTLL]^WSMFTLPV\m^HQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHM 77 6 



RESULT 3 
US-08-700-607-6 

Sequence 6, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 



OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 356 amino acids 
; TYPE: amino acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
; MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307309 
US-08-700-607-6 

Query Match 12.3%; Score 716.5; DB 2; Length 356; 

Best Local Similarity 46.1%; Pred. No. 4.6e-38; 

Matches 164; Conservative 4 8; Mismatches 89; Indels 55; Gaps 7; 

Qy 828 KESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSGADS 882 

: I I II hll I I : : I I I : : : : I I : I I 

Db 36 EEREAELDS ELIIESCDASSASEESPKREQDSPPMKPSALDAIREETGVRAEERAPS 92 

Qy 883 LPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSK 942 

II I I : I I I I I 1111:11 
Db 93 RRGLAEPG — SFLD-YPSTEPQPGPEL PPGDGALEPETPM 129 

Qy 943 SLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRDIKK 987 

II I : : I : : : I : I : I I I I I I I I I : 

Db 130 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRDIKQ 180 

Qy 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

I I : I I I : I II III I I : I II I I : I I I II I II I I I I I I : II : I I : I I I II I : I I 

Db ' 181 TGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAY 240 

Qy 1048 LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

II I : : I : I : I I I : : : M I I : I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I 

Db 241 LELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFN 300 

Qy 1108 GLTLLIL7VLISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I : : I : : I : I : : I I : I : I I III I I I I : : I I I I I I I I I II I .: 

Db 301 GLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 356 



RESULT 4 
US-08-700-607-7 

; Sequence 7, Application US/08700607 
; Patent No. 5858708 



GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals , Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/7 00,607 - 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 208 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307311 
US-08-700-607-7 

Query Match 11.7%; Score 685; DB 2; Length 208; 

Best Local Similarity 67.0%; Pred. No. 2.2e-36; 

Matches 128; Conservative 32; Mismatches 31; Indels 0; Gaps 0; 

Qy 973 KTSVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIAL7VLLSVTISFRI YKGVI 1032 

I : : I I I I I I I I I : I I : I I I : I II III I I : I I I I I : I I I I I I I I I I I I I I : 
Db 18 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 77 

Qy 1033 QAIQKS DEGH PFRAYLES EVAI S EELVQKYSN SALGHVN ST I KELRRLFLVDDLVDS LKF 1092 

I I : I I : I I I I I I : I I I I I : : I : I : I I I : : : I I I I : I I I I I I I I I I I I I I I I I 

Db 78 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 137 

Qy 1093 AVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQ 1152 

Mill: I I I I I I I I I I I I I : : I : : I : I : : I I : I : I I III I I I I : : I I I I 

Db 138 AVXMWLLTWG7VLFNGLTLLLMAWSMFTLPVVTVKHQAQIDQYLGLVRTHINAVVAKIQ 197 



Qy 1153 AKI PGLKRKAD 1163 

II I I I II I : 
Db 198 AKIPGAKRHAE 208 



RESULT 5 
US-08-700-607-8 

Sequence 8, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT : Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2 67 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 281046 
US-08-700-607-8 



Query Match 11.5%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 2.5e-35; 

Matches 124; Conservative 33; Mismatches 30; Indels 0; Gaps 0; 

Qy 973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1032 

I : : I I I I I I II I : I I : I I I : I II III I I : I I I I I : I II II I I I I I I I I I : 
Db 9 KSQAI DLLYWRDI KQTGIVFGS FLLLLFSLTQFS WS WAYLALAALSATI S FRI YKSVL 68 



Qy 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

I I : I I : I I I I I I : I I I I I : : I : I : I I I : : : I I I I : II I I I I I I I I I II I I I I 
Db , 69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 128 

Qy 1093 AVLMWFTWGALFNGLTLLILALISLFSIPV 1152 

Mill: I I I I I I I I I I I I I : : I : : I : I : : II : I : I I I : I I I I I : : I I I I 

Db 129 AVLMWLLTYVGALFNGLTLLLMAWSMFTLPVVYVKHQAQVDQYLGLVRTHINTWAKIQ 18 8 

Qy 1153 AKIPGLK 1159 

Mill: 
Db 189 AKIPGAR 195 



RESULT 6 

US-09-149-476-563 

; Sequence 563, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 

FILE REFERENCE: PZ002P1 
; CURRENT APPLICATION NUMBER: US/09/149, 476 
; CURRENT FILING DATE :. 1998-09-08 
; EARLIER APPLICATION NUMBER: PCT/US98/ 04493 
; EARLIER FILING DATE: 1998-03-06 * 
; EARLIER APPLICATION NUMBER: 60/040,162 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038,621 

EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/040,626 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,336 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 
; EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,615 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,597 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,502 

EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,633 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,583 
; EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,617 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,618 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,503 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,58 0 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 ' 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 



NUMBER: 60/056, 889 
: 1997-08-22 

NUMBER: 60/056,893 
: 1997-08-22 

NUMBER: 60/056,630 
: 1997-08-22 

NUMBER: 60/056, 878 
: 1997-08-22 

NUMBER: 60/056, 662 
: 1997-08-22 

NUMBER: 60/056, 872 
: 1997-08-22 

NUMBER: 60/056,882 
: 1997-08-22 

NUMBER: 60/056, 637 
: 1997-08-22 

NUMBER: 60/056, 903 
: 1997-08-22 

NUMBER: 60/056,888 
: 1997-08-22 

NUMBER: 60/056,879 
: 1997-08-22 

NUMBER: 60/056,880 
: 1997-08-22 

NUMBER: 60/056, 894 
: 1997-08-22 

NUMBER: 60/056, 911 
: 1997-08-22 

NUMBER: 60/056, 636 
: 1997-08-22 

NUMBER: 60/056, 874 
: 1997-08-22 

NUMBER: 60/056,910 
: 1997-08-22 

NUMBER: 60/056,864 
: 1997-08-22 

NUMBER: 60/056, 631 
: 1997-08-22 

NUMBER: 60/056,845 
: 1997-08-22 

NUMBER: 60/056, 892 
: 1997-08-22 

NUMBER: 60/057, 761 
: 1997-08-22 

NUMBER: 60/047, 595 
: 1997-05-23 

NUMBER: 60/047, 599 
: 1997-05-23 

NUMBER: 60/047, 588 
: 1997-05-23 

NUMBER: 60/047, 585 
: 1997-05-23 

NUMBER: 60/047,586 
: 1997-05-23 ' , 

NUMBER: 60/047,590 
: 1997-05-23 

NUMBER: 60/047, 594 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NtfMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 



Query Match 9.2%; Score 539.5; DB 4; Length 168; 

Best Local Similarity 59.3%; Pred. No. 3.2e-27; 

Matches 99; Conservative 36; Mismatches 31; Indels 1; Gaps 1; 

Qy 998 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 1057 

: I I I I I I : : I I : I : I I I I I I I I I I II I I I I I I : I I I : I I I I I : I I I : : : : I I 
Db 1 MLLSLAAFSVISWSYLILALLSVTISFRIYKSVIQAVQKSEEGHPFKAYLDVDITLSSE 60 



Qy 

Db 



1058 LVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI 1117 

I I : I : I : I : I : I I M I : I I I I I I I II I I : M I I I : I M : I I I I II : 
61 AFHNYMNAAMVHINRALKLI I RLFLVEDLVDS LKLAVFMWLMT YVGAVFNGITLLI LAEL 120 



Qy 1118 S L FS I PVI YERHQVQI DHYLGLANKS VKDAMAKI QAKI PGL- KRKAD 1163 

: I I : I : : II : :: I I I I I : I : I I : I I I I I : I I : I : I I : 

Db 121 LI FSVPIVYEKYKTQI DHYVGIARDQTKSIVEKIQAKLPGIAKKKAE 167 



RESULT 7 
US-08-700-607-3 

Sequence 3, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte" Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: '94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 241 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: THP1NOB01 
CLONE: 31870 
US-08-700-607-3 

Query Match 8.9%; Score 519; DB 2; Length 241; 

Best Local Similarity 59.4%; Pred. No. l.le-25; 

Matches 95; Conservative 33; Mismatches 32; Indels 0; Gaps 0; 



QY 



972 S KT SWDLLYWRDI KKTGWFGASLFLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGV 1031 
I : I II: I I I : I I II III : I : I I I I I I : : I I : I : II I I I I I I I I I I I I I 



Db 



44 S S CAVHD L I XWRD VKKT G FVFGTT L I MLL S LAAFS VI SWSYL I LALL S VT I S FRI YKS V 103 



QY 1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 1091 

I I I : I I I M I I I I : I I I : : : : I | I I : I : I : I : I : I I I I I : I I 

Db 104 IQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVHINRALKLIIRLFLVEDLVDSLK 163 

Qy 1092 FAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQV 1131 

II II: I I I I I : I I I : I I I I I I : : I : | : : | : : : | 
Db 164 LAVFMWLMTYVGAVFNGITLLILAELLIXSVPIVYXKYKV 203 



RESULT 8 

US-08-714-741-32 

Sequence 32, Application US/08714741 
Patent No. 6500613 
GENERAL INFORMATION : 

APPLICANT: Briles, David E. 
APPLICANT: McDaniel, Larry S. 
APPLICANT: Swiatlo, Edwin 
APPLICANT: Yother, Janet 
APPLICANT: Crain, Marilyn J. 
APPLICANT: Hollingshead, Susan 
APPLICANT: Tart, Rebecca 
APPLICANT: Brooks-Walter, Alexis 

TITLE OF INVENTION: PNEUMOCOCCAL GENES, PORTIONS THEREOF, 

TITLE OF INVENTION: EXPRESSION PRODUCTS THEREFROM, AND USES OF SUCH GENES, 
TITLE OF INVENTION: PORTIONS AND PRODUCTS 
NUMBER OF SEQUENCES : 4 7 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Curtis, Morris & Safford, P.C. 
STREET: 530 Fifth Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S. 
ZIP: 10036 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patentln Release #1.0, Version . #1 . 30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/714,741 
FILING DATE: 16-SEP-1996 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: Frommer Esq., William S. 
REGISTRATION NUMBER: 25,506 
REFERENCE/DOCKET NUMBER: 4 54 312-24 60 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 840-3333 
TELEFAX: (212) 840-0712 
INFORMATION FOR SEQ ID NO: 32: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 8991 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



MOLECULE TYPE: amino acid 
US-08-714-741-32 



Query Match 5.2%; Score 302.5; DB 4; Length 8991; 

Best Local Similarity 22.1%; Pred. No. 1.6e-09; 

Matches 244; Conservative 125; Mismatches 439; Indels 297; Gaps 47; 

Qy 18 PRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAP 77 

I : I I I : : I I : I : : I I I III I : : I I I I 

Db 7805 PKPAPAPQPAPAPKPEKTDDQQAEEDYARRSE— EEYNRLPQQQ PPKAEKP 7853 

Qy 78 LLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPP 137 

: I I I : I I I I I I I I I I I I I I I I 

Db 7854 APAPKPEQPVPAEXPENP APAPK — PAXAPQPLKPEEPAEQP 7893 

Qy 138 ARPPPPPPAGASPLAEPAAPPSTPAAP — KRRGS G S VDET L FAL P AAS E P VI P S S AEKIM 195 

I III III I III: 1:1 I : I I 

Db 7894 KPEKPEEPAGQ PEPEKPDDQQAGEDYARRSGGEYNRFPQQQPPKAEK — PAPAPK- — 7946 

Qy 196 DLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSF KEHGYLGNLSAVSS 24 8 

III I II: I : I : I I III: 

Db 7 947 — PEQP VPAPKTLLKKAKLAGAKSKAATKKAELEPELEKAEAELENLLSTLD 7996 

Qy 249 SEGTIEETLNEASKELPERATNPFVNRDLAEFSEL — EYSEMGSSFKGSPKGESAILVEN 306 

II : I : I I I I II : I I I I I : : : I : : : 

Db 7997 PEG KTQDELDKEAAEAELNKKVEALPNQVSELEEELSKLEDNLKDAETNNVEDYIKE 8053 

Qy 307 TKEEVIVRSK DKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WAPVR 363 

II I : : I I I I : : I I I : II II 
Db 8054 GLEEAIATKQAELEKTPKELDAALN— ELGPDGDEEETPPPE APAE 8097 

Qy 364 EEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED — SLEQKSLGKDSEGRN 421 

: : I I I : I I I I I I : I : I I : I : 

Db 8098 QPKPE-KPAEETPAPAPKPEKSADQQA EEDYARRSEEEYNRLTQQQPPKAEKPAP 8151 

Qy 422 EDASFPSTPEPVKDSSRAYITCAS FTSA TESTTANTFPLLEDHT 465 

I I I I II I I I I I I 

Db 8152 APAPKPEQPAPA-PKSRGLATKKKLNLAEARIELLLKKLGLEPGLEKAGAGLGNLLSTLD 8210 

Qy 466 SENKTDE KKIEERKAQI — ITEKTSPKTSNPFLVAVQDSEADYVTTDT 511 

III: I I : I | : : | : | I : : | : | : : | 

Db 8211 PEGKTQDELDKEAAEAELNKKVEALPNQVAELEEELSKLEDN LKDAETNHVEDYI 8265 

Qy 512 LSKVTEAAVSNMPE ■ GLTPDLVQE ACESELNEATGTKIAYET 552 

: I I : I I I I : I II: : I I I I 

Db 8266 KEGLEEAIATKQAELEKTPKELDAALNELGPDGDEEETPAPEAPAEQPKPEKPAEETPAP 8325 

Qy 553 KVDLVQTS EAI QES L YPTAQLC P S FEEAEAT P S PVL P D I VMEAP 596 

: I : I I I I I I I : I : I I : II 

Db 8326 APKPEKSADQQAEEDYARRSEEEYNRL — T QQQ P P KAEKP AP AP AP - K P EQ P AP AP KKKQ 8382 

Qy 597 LNSLLPSA GASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAM 641 

I : : I I II::: III : I I I I I : 

Db 8383 KVNLENLLSTLDPGGKTQDELDKGAAEAELNKKVEALPNPVX ELEEELSPPEDN — 8436 

Qy 642 NVALKALGT KEGIKE PES FN AAV QETEAPYIS 673 

II I I I I : : I I : : I I : : I I I I 



Db 8437 - — LKDAETNHVEDYIKEGLEEAIATKQAELEETPQEVDAALNDLVPDGGEEETPAP 8490 

Qy 674 IACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE LVEDSSPESE 720 

: : : I : I : : I I II : I I I : I : : I 

Db 8491 APQPDEPAPAPAPNAEQPAPAPKPEKSADQQAEEDYARRSEGEYNRLTQQQPPKAE 8546 

Qy 721 -PVDLFSDDSIPEVPQTQEEAVML MKESLTEVSETVAQHKEERLSASPQ 768 

I : I : I I ::: : I I : I : i : : I 

Db 8547 KPAPAPAPKPEQPAPAPNKEIARLQSDLKDAEENNVEDYIKEGLEQAITNKKAELATTQQ 8606 

Qy 769 ELGKPYLESFQPNLHSTKDAASNDIPTLTKKE — KISLQMEEFNTAI YSNDDLL 820 

: I : I I I : I I : I : : : : : I : : : I 

Db 8607 NIDKTQKDLEDAELELEKVLATLDPEGKTQDELDKEAAEAELNEKVE7VLQNQVAELEEEL 8 666 

Qy 821 SSKEDKIKESETFSDSSPI-EIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSG 879 

I I I : | : : I I : I I : : I : I : : | | : | 
Db ' 8667 S KLEDNLKDAETNNVEDYI KEGLEE AIATKKAELEKTQKE 8706 

Qy 880 ADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIV 939 

I : : I : : I I : I I I I : I 
Db 8707 — LDAALNELGPDGD EEETPAPAPQPEKPAEEPEN-PAPAPKPE 8747 

Qy 940 KSKSLTKEAEKKLPSDTEKEDRSLS 964 

II : : I I : : I : I I : 

Db 874 8 — KSADQQAEEDYARRSEEEYNRLT 8770 



RESULT 9 

US-09-976-594-726 

; Sequence 726, Application US/09976594 

; Patent No. 6673549 

; GENERAL INFORMATION: 

; APPLICANT: Furness, Michael 

; APPLICANT: Buchbinder, Jenny 

; TITLE OF INVENTION: GENES EXPRESSED IN C3A LIVER CELL CULTURES TREATED WITH 
STEROIDS 

FILE REFERENCE: PA-0041 US 
; CURRENT APPLICATION NUMBER: US/09/976,594 
; CURRENT FILING DATE: 2001-10-12 
; PRIOR APPLICATION NUMBER: 60/240,409 
; PRIOR FILING DATE: 2000-10-12 
; NUMBER OF SEQ ID NOS : 1143 
; SOFTWARE: PERL Program 
; SEQ ID NO 726 
LENGTH: 2468 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/ KEY : misc_feature 

OTHER INFORMATION: Incyte ID No. 6673549 4295277CD1 
US-09-976-594-726 

Query Match 4.9%; Score 288; DB4; Length 2468; 

Best Local Similarity 20.2%; Pred. No. 2e-09; 

Matches 251; Conservative 166; Mismatches 441; Indels 386; Gaps 56; 



Qy 



13 STDSPPR — PPPAFKYQFVTEPEDEEDEEE EED 



EEEDDED 50 



: I I I : I : : I I I : : : I : I : I I II I 

Db .625 ATDVKPKAAKEKTVKKETKVKPEDKKEEKEKPKKEVAKKEDKTPIKKEEKPKKEEVKKEV 68 4 

Qy 51 LEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWER 110 

: I : : I : I | : : : | : : : : : I : 
Db 685 KKEIKKEEKK EPKKEVKKETPPKEVKKEVKKEEKKEVKKE EK 72 6 

Qy 111 SPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRG — 168 

I III : I I : I I I I I : I I I : I 

Db 727 EPKKEIKKLPKDAKKSSTPLSEAKKP7VALKPKVPKKEESVKKDSVA AGKPKEKGKI 782 

Qy 169 SGSVDETLFAL PA AS EPVI P S S AEKIMDLME 199 

I I : I II I : : I : I I 

Db 783 KVI KKEGKAAEAVAAAVGTGATTAAVMAAAGI AAI GPAKELEAERS LMS S PEDLTKDFEE 842 

Qy 200 QPGNTVSSGQEDFPSV-LLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLN 258 

I :: I : hi I h :: : I I I I I 

Db 843 LKAEEVDVTKDIKPQLELIEDEEKLKETEPVEAYVIQKEREVTKGPAESPDEG-ITTTEG 901 

Qy 259 EASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTK-EEVIVRSKD 317 

I I I : : : : I : I : I : I I : I : I I : I 

Db 902 EGECEQTPEELEPVEKQGVDDIEKFE — DEGAGFEES — SETGDYEEKAETEEAEEPEED 957 

Qy 318 KEDLVC- SAALHS P QESPVGKEDRWSPEKTMDIFNEMQMSV 358 

I : I I I I : I I I Ml : II I : I I I 
Db 958 GEEHVCVSASKHSPTEDEESAKAEADAYIREKRESVASGDDRA EEDMD EAIEKG 1011 

Qy 359 VAPVREEYADFKPFEQAWEVK-DTYEGSRDVLAARANVESKVDRKCLEDSLEQ 410 

I MM: : : | : : : | | : : | I : I I : I : 

Db 1012 EAEQSEEEADEE — DKAEDAREEEYEPEK— MEAEDYVMAWDKAAEAGGAEEQYGFLTT 1067 

Qy 411 — KSLGKDSEGR NEDASFPSTPEPVKDSSRAYITCASFTSATESTT 454 

I I I I I I : I : I I : I : : : I : I : 

Db 1068 PTKQLGAQSPGREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEIS 1127 

Qy 455 ANTFPL LEDHTSENKTDEK KIEERKAQIITEKTS-PKTSNPFLVAV 499 

: I : : | | : : | : | ::::::: | | | 

Db 1128 SEPTPMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKPADVTPLNGFS 1187 

Qy 500 QDSEADYWTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDL 556 

: I : I II | : ^: | : | : | | : | : : : | : | : 
Db 1188 EGSKTD — ATDGKDYNASASTISPPSSMEED KFSRSALRDAYCSEVKASTTLDIKDS 1242 

Qy 557 — VQTSEAIQESLYPTAQLCP SFE EAEATPSPVLPDIV 592 

Ml: I I : I : I I I I : II I : : 

Db 1243 ISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFSLTPNEIKVSAEAEVAPVSPEVT 1302 

Qy 593 ME APLNSLLPSAG ASWQ 610 

I : I I : I I I I :: 

Db 1303 QEWEEHCASPEDKTLEWSPSQSVTGSAGHTPYYQSPTDEKSSHLPTEVIEKPPAVPVS 1362 

Qy 611 PSVSPLEAPPPVSYDSIK — LEPENPPP YEEAMNVALKALG 64 9 

II I I :: I I I I : I I II I I : : Ml 

Db 1363 FEFSDAKDENERASVSPMDEPVPDSESPIEKVLSPLRSPPLIGSESAYESFLSADDKASG 1422 

Qy 650 TKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPD-FSNYSEIA KF 701 

: I M : II : I II I I I : I 



Db 



1423 



RGAESPF 



EEKS GKQGS P DQVS PVS EMT ST S LYQDKQ 1458 



Qy 702 EKSVPEHAELVEDSSPE SEPVDLFSDDSIPEVPQTQ EEAV 741 

I : | : | | | | : | : : : I I I I : 

Db 1459 EGKSTDFAPI KEDFGQEKKTDDVEAMS SQPALALDERKLGDVS PTQI DVSQFGS FKEDTK 1518 

Qy 742 MLMKE SLTEVSETVAQ HKEERLSASPQELG KPYLESFQPNLHSTK 786 

I : I I I I I I I : II II : : I : I : I I 
Db 1519 MS I S EGTVS DKS AT P VDEGVAEDT YSHMEGVASVSTAS VAT S S FPEPTTDDVS P S LH 1575 

Qy 787 DAASNDIPTLTK-KEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEF 845 

I | | : : : | : : : | : : : | | | : : 
Db 1576 --AEVGSPHSTEVDDSLSVSWQTPTT-FQETEMSPSKEECPR 1615 

Qy 846 PTFVSAKDDS PKLAKEYTDLE — VSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I : I I I I I I I I :: | :: | : : | : | : | I I I I :: 
Db 1616 PMSISPPDFSPKTAKSRTPVQDHRSEQSSM-SIEFGQES-PEQSLAMDFSRQSPDHPTVG 1673 

Qy 897 IYPKDE-VHVSDEFSENRSSVSKASISPSNV 926 

I I : I : I III: III 

Db 1674 AGVLHITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTS 1733 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE 97 0 

I I I : : : I : : : |': I : I I I I I : : I 

Db 1734 SAHTP-SQIASPLQEDTLSDVAPPR DMSLYASLTSE 17 68 



RESULT 10 
US-09-149-476-411 

Sequence 411, Application US/09149476 
Patent No. 6420526 
GENERAL INFORMATION: 
APPLICANT: Rosen et al . 

TITLE OF INVENTION: 18 6 Human Secreted proteins 
FILE REFERENCE: PZ002P1 

CURRENT APPLICATION NUMBER: US/09/149,476 
CURRENT FILING DATE: 1998-09-08 
EARLIER APPLICATION NUMBER: PCT/US98/04493 
EARLIER FILING DATE: 1998-03-06 
EARLIER APPLICATION NUMBER: 60/040,162 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,333 
EARLIER FILING DATE:' 1997-03-07 
EARLIER APPLICATION NUMBER: 60/038,621 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,626 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,334 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,336 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,163 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/047,600 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,615 
EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 



EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 



: 1997-04-11 

NUMBER: 60/043,313 
: 1997-04-11 

NUMBER: 60/043,672 
: 1997-04-11 

NUMBER: 60/043, 315 
: 1997-04-11 

NUMBER: 60/048,974 
: 1997-06-06 

NUMBER: 60/056, 886 
: 1997-08-22 

NUMBER: 60/056, 877 
: 1997-08-22 

NUMBER: 60/056, 889 
: 1997-08-22 

NUMBER: 60/056, 893 
: 1997-08-22 

NUMBER: 60/056,630 
: 1997-08-22 

NUMBER: 60/056,878 
: 1997-08-22 

NUMBER: 60/056,662 
: 1997-08-22 

NUMBER: 60/056, 872 
: 1997-08-22 

NUMBER: 60/056, 882 
: 1997-08-22 

NUMBER: 60/056, 637 
: 1997-08-22 

NUMBER: 60/056, 903 
: 1997-08-22 

NUMBER: 60/056, 888 
: 1997-08-22 

NUMBER: 60/056, 879 
: 1997-08-22 

NUMBER: 60/056,880 
: 1997-08-22 

NUMBER: 60/056, 894 
: 1997-08-22 

NUMBER: 60/056,911 
: 1997-08-22 

NUMBER: 60/056, 636 
: 1997-08-22 

NUMBER: 60/056, 874 
: 1997-08-22 

NUMBER: 60/056, 910 
: 1997-08-22 

NUMBER: 60/056, 864 
: 1997-08-22 

NUMBER: 60/056, 631 
: 1997-08-22 

NUMBER: 60/056, 845 
: 1997-08-22 

NUMBER: 60/056, 892 
: 1997-08-22 

NUMBER: 60/057, 761 
: 1997-08-22 



EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 



NUMBER: 60/047, 595 
: 1997-05-23 

NUMBER: 60/047,599 
: 1997-05-23 

NUMBER: 60/047,588 
: 1997-05-23 

NUMBER: 60/047, 585 
: 1997-05-23 

NUMBER: 60/047, 586 
: 1997-05-23 

NUMBER: 60/047, 590 
: 1997-05-23 

NUMBER: 60/047, 594 
: 1997-05-23 

NUMBER: 60/047,589 
: 1997-05-23 

NUMBER: 60/047,593 
: 1997-05-23 

NUMBER: 60/047, 614 
: 1997-05-23 

NUMBER: 60/043,578 
: 1997-04-11 

NUMBER: 60/043, 576 
: 1997-04-11 

NUMBER: 60/047,501 
: 1997-05-23 

NUMBER: 60/043, 670 
: 1997-04-11 

NUMBER: 60/056, 632 
: 1997-08-22 

NUMBER: 60/056,664 
: 1997-08-22 

NUMBER: 60/056, 876 
: 1997-08-22 

NUMBER: 60/056, 881 
: 1997-08-22 

NUMBER: 60/056, 909 
: 1997-08-22 

NUMBER: 60/056, 875 
: 1997-08-22 

NUMBER: 60/056, 862 
: 1997-08-22 

NUMBER: 60/056, 887 
: 1997-08-22 

NUMBER: 60/056, 908 
: 1997-08-22 

NUMBER: 60/048, 964 
: 1997-06-06 

NUMBER: 60/057, 650 
: 1997-09-05 

NUMBER: 60/056, 884 
: 1997-08-22 

NUMBER: 60/057, 669 
:. 1997-09-05 

NUMBER: 60/049, 610 
: 1997-06-13 

NUMBER: 60/061, 060 



EARLIER FILING DATE: 



1997-10-02 



Query Match 4.9%; Score 286; DB 4; Length 92; 

Best Local Similarity 56.0%; Pred. No. 2.2e-ll; 

Matches* 51; Conservative 21; Mismatches 19; Indels 0; Gaps 0; 

Qy 1064 NSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIP 1123 

I : I : I : I : I : I I I I I : I I I I I I I II I I : I I I I I : I I I : I I I I I I - : I I : I 
Db 2 NAAMVHINRALKLIIRLFLVEDLVDSLKLAVFMWLMTYVGAVFNGITLLILAELLIFSVP 61 

Qy 1124 VI YERHQVQI DHYLGLANKSVKDAMAKIQAK 1154 

: : I I : : : I I I I I : I : I I : I I : I 

Db 62 I VYEKYKTQI DHYVGI ARDQTKS I VEKI P S K 92 



RESULT 11 
US-08-973-462-8 

Sequence 8, Application US/08973462B 
Patent No. 6191270 
GENERAL INFORMATION: 
APPLICANT: DRUILHE, PIERRE 
APPLICANT: DAUBERS I ES, PIERRE 

TITLE OF INVENTION: MALARIAL PRE- ERYTHROCYTIC STAGE POLYPEPTIDE MOLECULES 
FILE REFERENCE: 0660-0125-0 PCT 
CURRENT APPLICATION NUMBER: US/08/973, 462B 
CURRENT FILING DATE: 1998-02-06 
EARLIER APPLICATION NUMBER: PCT/FR96/ 008 94 
EARLIER FILING DATE: 1996-06-12 
EARLIER APPLICATION NUMBER: FR 95/07007 
EARLIER FILING DATE: 1995-06-13 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Pa tent In Ver. 2.0 
SEQ ID NO 8 
LENGTH: 1786 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Description of Artificial Sequence : Polypeptide 
US-08-973-462-8 

Query Match 4.8%; Score 279.5; DB 3; Length 1786; 

Best Local Similarity 20.1%; Pred. No. 4.4e-09; 

Matches 261; Conservative 220; Mismatches 518; Indels 297; Gaps 60; 

Qy 33 EDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDS VP 87 

I I : Mill : I : : I I II I I : : I : I I II I 

Db 225 EKVEESVEENDEESVEENVEE-NVEENDDG SVASSVEESIASSVDESIDSSIEENVA 280 

Qy 88 P AP RGP L PAAP P AAP ERQ P S WERS PA APAPSLPPAAAV-LPSKLPEDDE 135 

I II II : : | | I : | : : : | : : | : | 

Db 281 PTVEEIVAPSVVESVAPSVEESVEENVEESVAENVEESVAENVEESVAENVEESVAENVE 340 

Qy 136 PPARP PPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPS 189 

I I : I : | | : : I I I : : : I I I 

Db 341 EIVAPTVEEIVAPTVEEIVAPSWESVAPSVEESVEENVEESVAENV— EESVAENVEES 398 

Qy 190 SAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASL — P S L S P L S TVS FKEHG YLGNL S AVS 247 



Db 399 VAENVEE S VAE — NVE E S VAENVE E I VAP T VE E I VAP T VE E I VAP S WE SVAP 44 9 

Qy 248 SSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENT .307 

I I : : I I : I : I I : I : I I I 1 I : I I 

Db 450 SVEESVEENVEESVAENVEESVAENVEESVAENVE ES — VAENV 491 

Qy 308 KEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYA 367 

: I I : : : I : I : : I I : : I : I I : : : I I I II 

Db 4 92 EESV AENVEESVA ENVE E S VAEN VE E I VAP — TVE EIVAPTVEEIV 535 

Qy 368 DFKPFEQ-AWEVKDTYEGSRDVIAARANVESKVDRKCLEDSLEQKSLGKDSEGRN-EDAS 425 

I I | : : : I : : I III I : I : I : : : : I I I : 

Db 536 AP S WE S VAP S VE E S VEENA^E E S VAE- NVE ES VAEN -VEES VAENV — E E S VAENVE E I V 591 

Qy 426 FPSTPEPVKDSSRAYITCAS FT SATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITE 485 

I : I I : : III I : I : II . : | | | : : I 
Db 592 AP T VEE I VAP TVE E I VA PSWESVA PSVEESVEEN VEES VAENVEE 637 

Qy 4 86 KTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATG 54 5 

: I:: I I II : : I : I I : I : I : I : 

Db 638 SVAENVEESVAENVEEIVAPTVEEIVAPTVEEIVAPSWESVAPS-VEESVEENVEESVA 696 

Qy 546 TKIAYETKVDLVQTS — EAIQESLYPTAQ — LCPSFEEAEATPSPVLPDIVMEAPLNSLL 601 

: I : : I : I I : : I I : : : I : I I I I : : II : : I : 
Db 697 ENVE- E S VAEN VE E S VAEN VEE S VAENVE E I VAPT VE E I VA PTVEEIVAPSWESVA 752 

Qy 602 PSAGASV — VQPSVSPLEAPPPVSYDSIKLE 630 

II II | | : | : | | : | 

Db 753 P S VEES VEENVEESVAENVEES VAENVEE S VAENVEE S VAPT VEE I VAP SVE 804 

Qy 631 PENPPPYEEAM NVALKALGT KEGI KEP ES FNAAVQETEAP YI S I ACDLI K 680 

III:: I : | | | : : | , : : I : : : : : : 

Db 805 ESVAPSVEESVAENVATNLSDNLLSNLLGGIETEEIKDSILNEIEEVKENWTTILENVE 864 

Qy 681 ETKLSTEPSPDFSNYSEIAK FEKSVPEHAELV EDSSPESEPVDLF-- 725 

II : I III I : I : : I I I | : : | | : : 

Db 865 ET — TAESVTTFSNILEEIQENTITNDTIEEKLEELHENVLSAALENTQSEEEKKEVIDV 922 

Qy 726 SDDS I PEVPQTQEEAVMLMKES LTE VSETV 755 

I : : I I : : II : I : : I I II 

Db 923 IEEVKEEVATTLIETVEQ7VEEKSANTITEIFENLEENAVESNENVAENLEKLNETVFNTV 982 

Qy 756 AQHKEERLSASPQELGKPYLES— FQPNLHSTKDAASNDIPTLTK--KEKISLQMEE 808 

I I : I : I : : I : I | : -: : : I : I I I 

Db 983 LDKVEETVEISGESLENNEMDKAFFSEIFDNVKGIQENLLTGMFRSIETSIVIQSEEKVD 1042 

Qy 809 FNTAIYSN— DDLLSSKEDKIKESETFSDSSPI-EIIDEF PTFVS 850 

I : I : I :: : I I : : I I : : I : I I : 

Db 1043 LNENWSSILDNIENMKEGLLNKLENISSTEGVQETVTEHVEQNVYVDVDVPAMKDQFLG 1102 

Qy 851 AKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEVHVSDEFS 910 

::: I : : : I I III : I : I : : I : I : I : : I 

Db 1103 ILNEAGGLKEMFFNLEDVFKSE S DVI TVEEI KDEPVQKEV-EKETVS 1 1 EEME 1154 

Qy 911 ENRSSV SKASISPSNVSALEPQTEMGSIVKSKSLT-KEAEKKLPSDTEK EDR 961 

II I I : : : I : I I : I I : : : I : I I : I : ■ I 



Db 



1155 



ENIVDVLEEEKEDLTDKMIDAVEESIEISSDSKEETESIKDKEKDVSLWEEVQDNDMDE 



1214 



Qy 962 SLSAVL SAELS KTSV-VDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI A 1015 

I : II II I : I : : : : I : : | : I : : 

Db 1215 SVEKVLELKNMEEELMKDAVEINDITSKLIEETQELNEVEADLIKDME K 1263 

Qy 1016 LALLSVTISFRIYKGVIQA IQK- S DEGHPFRAYLESEVA 1 SEELVQKYSN — S 1065 

II :| I :| I ::| :| | |: | : |: ::| |: 

Db 1264 LKELEKALS-EDSKEIIDAKDDTLEKVIEEEHDITTTLDEWELKDVEEDKIEKVSDLKD 1322 

Qy 1066 ALGHWSTIKELRRLFLVDDLVDSLKFAVLMm/FTWGALFNGLT^ 1125 

: : I I : : I : : : : I : : : 

Db 1323 LEEDILKEVKEIKE— LESEILEDYK ELKTIETDIL 1356 

Qy 1126 YERHQVQI DHYLGLANKS — VKDAMAKIQAKIPGLK 1159 

I: ::: I |: :: :| I II :: I : 

Db 1357 EEKKEIEKDHFEKFEEEAEEIKDLEADILKEVSSLE 1392 



RESULT 12 
US-08-769-309A-5 

; Sequence 5, Application US/08769309A 

; Patent No. 5741890 

; GENERAL INFORMATION: 

; APPLICANT : Scott, John D., 

APPLICANT: Nauert, Brian J., 

APPLICANT: Klauck, Theresa M. 

TITLE OF INVENTION: Protein Binding Domains of Gravin 

NUMBER OF SEQUENCES: 24 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & Borun 
; STREET: 6300 Sears Tower/233 South Wacker Drive 

; CITY: Chicago 

; STATE: Illinois 

; COUNTRY: United States of America 

ZIP: 60606-6402 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/769, 309A 

; FILING DATE: 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: No. 5741890and, Greta E. 

REGISTRATION NUMBER: 35,302 

REFERENCE/ DOCKET NUMBER: 27866/33451 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 312-474-6300 
; TELEFAX: 312-474-0448 

; TELEX: 25-3856 

; INFORMATION FOR SEQ ID NO: 5: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 178 0 amino acids 

TYPE: amino acid 



TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-08-769-309A-5 

Query Match 4.5%; Score 265.5; DB 1; Length 1780; 

Best Local Similarity 20.8%; Pred. No. 3.4e-08; 

Matches 264; Conservative 146; Mismatches 486; Indels 371; Gaps 51; 

Qy 11 SSSTDSPPRPPPA FKYQFVTE PEDE EDEEEEEDEEED 47 

I I : I I I : 11:11: III I : : I : | | : | 

Db 277 SKSAESPTSPVTSETGSTFK-KFFTQGWAGWRKKTSFRKPKEDEVEASEKKKEQEPEKVD 335 

Qy 48 DEDLEELEVLERKPAAGLSAAAVPPAAAA — PLLDFSSDSVPPAPRGPLPAAPPAAPERQ 105 

I : : I I I I I I I : I I I : I I I : I I 

Db 33 6 TEEDGKAEVASEKLTASEQAHPQEPAESAHEPRLSAEYEKV ELPS EEQ 383 

Qy 106 PSWERSPAAPAPSLPPAAAVLPSKLP EDDEPPARPPPPPPAGASP 150 

1:1: I: I I I h |: : I |: I 

Db 384 VSGSQGPSEEKPA-PLATEVFDEKIEVHQEEWAEVHVSTVEERTEEQKTEVEETAGSVP 442 

Qy 151 L AEP — AAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQP 201 

I I I I I I II I : II: I : : : I 

Db 443 AEELVGMDAEPQEAEPAKELVKLKETCVSGEDPTQGADLSPDEKVLSKPPEGWSEVEML 502 

Qy 202 GNTVS S GQEDFP S VLLETAAS LP S LS PLSTVS FKEHGYLG NLSAVSSSEGTI 253 

: : I I I : I I I I : I I : : I : 

Db 503 SSQERMKVQGSPLKKLFTSTGLKKLS GKKQKGKRGGGDEESGEHTQVPADSPDSQ 557 

Qy 254 EETLNEASKELPERATN-PFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVI 312 

II I : I II : : | | | : : I I :: I I I : I : I 

Db 558 EEQKGESSASSPEEPEEITCLEKGLAEVQQDGE7VEEGATSDGEKKREGVTPWASFKKMVT 617 

Qy 313 VRSK DKE D L VC S AALH S P Q E S P VGKE D RWS P E KTMD I FN EMQMS WAP V 362 

: : III I I I I I I : : : : I I : I I I 

Db 618 PKKRVRRPSESDKEDELDKVKSATLSSTEST ' — ASEMQEEMKGSVEEPK 664 

Qy 363 REEYADFKPFEQAWEV KDTYEGS 385 

M =M || |: 

Db 665 PEEPKRKVDTSVSWEALICVGSSKKRARRRSSSDEEGGPKAMGGDHQKADEAGKDKETGT 72 4 

Qy 38 6 RDVLA -ARANVESK VDRKCLEDSLEQKSLGKDS 417 

: I I I : I : II : I I : I I : I I 

Db 725 DGILAGSQEHDPGQGSSSPEQAGSPTEGEGVSTWESFKRLVTPRKKSKSKLEEKS — EDS 782 

Qy 418 -EGRNEDASFPSTPEPVKDSS RAY I T CAS FT SAT E — ST 453 

I : I I I I I I: I :| : I III I 

Db 783 IAGSGVEHSTPDT-EPGKEESWVSIKKFIPGRRKKRPDGKQEQAPVEDAGPTGANEDDSD 841 

Qy 454 TANTFPLLEDHTSENKTDE KKIEERKAQIITEKTSPKTSNPFLVAVQD 501 

III I : I : : | : : | : : : : | : | | | 

Db 842 VP AWP L S E YDAVE RE KMEAQQAQ KGAEQ P EQ KAAT EVS KE L S E S QVHMMAAAVADGT RA 901 

Qy 502 SEADYVTTDTLSK VTEAAVSNMPEG- 526 

: I I : I : I : III : I I 

Db 902 ATIIEERSPSWISASVTEPLEQVEAEAALLTEEVLEREVIAEEEPPTVTEP LPENR 957 

Qy 527 LT PD LVQ EA- C E S E LNEAT GT K I - AYET KVD L VQT S EAI QE S L YP TAQ 572 



pb 958 EARGDTWSEAELTPEAVTAAETAGPLGSEEGTEASAAEETTEMVSAVSQLTDSPDTTEE 1017 

Qy 573 L CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVS 623 

I I I I II:: I I I I I : I I 

Db 1018 ATPVQEVEGGVPDIEEQERRTQEVLQAVAEKVKEESQLPGTG GPEDVLQPVQ 1069 

Qy 624 YDSIKLEPENPPPYEEA MNVALK ALGTKEGIKEPESFNAAVQE 666 

: I I I II : I I I I I I I I I I I 

Db 1070 RAEAERPEEQ7VEASGLKKETDWLKVDAQEAKTEPFTQGKWGQTTPESFEKAPQV 1125 

Qy 667 TEAPYISIACDLIKETKLSTEPSPD — FSNYSEIAKFEKSVPEHAELVEDSSPES — EPV 722 

II: I : : : I I : I : I : : : I I : I I 

Db 1126 TES IESSELVTTCQAETLAGVKSQEMVMEQAIP PDSVETPT 1166 

Qy 723 DLFSDDSIP EVPQTQEEAVMLMKESLTEVSETVAQH KEERLSAS PQELGKP Y 774 

I : I I I : I I : I : : I : I I : I I : : I 

Db 1167 DSETDGSTPVADFDAPGTTQ KDEIVEIHEENEVHLVPVRGTEAEAVPAQKERPP 122 0 

Qy 775 LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

I II: : I : : I : : : I I : : I : : : : 

Db 1221 APSSFVFQEETKE QSKMEDTLEHTDKEVSVETVSILSKTEGTQEADQYA 1269 

Qy 835 DSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPC 8 85 

I :: II I I III: I : : I I I I I 

Db 1270 DEKTKDVPFFEGLEGSIDTGITVSREKVTEVALKGEGTEEAECKKDDALELQSHAKSPPS 1329 

Qy 886 LELPCDLSF KNIYPKDEVHVSDEFSENRSS VSKASI SPSNVSALEPQTE 934 

I : : : | | : : | | : : : III: I I : : I 

Db 1330 PVEREMWQVEREKTEAEPTHWEEKLEHETAVTVS EEVS KQLLQT VNVP 1 1 DGAKE 1386 

Qy 935 MGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWF — 992 

: I : I | : : : : : I I : I : | | : I : I I 

Db 1387 VSSLEGSPPPCLGQEEAVCTKIQVQSSEAS FTLTAAAEEEKV — LGETANILETGETLEP 1444 

Qy 993 -GASLFL 998 

I I I I 

Db 1445 AGAHLVL 1451 



RESULT 13 
US-08-994-570-5 

; Sequence 5, Application US/08994570 

; Patent No. 6090929 

; GENERAL INFORMATION : 

; APPLICANT: Scott, John D., 

APPLICANT: Nauert, Brian J., 
; APPLICANT: Klauck, Theresa M. 

; TITLE OF INVENTION: Protein Binding Domains of Gravin 
; NUMBER OF SEQUENCES: 24 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O 1 Toole, Gerstein, Murray & Borun 

STREET: 6300 Sears Tower/233 South Wacker Drive 
; CITY: Chicago 

; STATE: Illinois 

COUNTRY: United States of America 

ZIP: 60606-6402 



COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/994, 570 
; FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME : No. 6090929and, Greta E. 

REGISTRATION NUMBER: 35,302 

REFERENCE/ DOCKET NUMBER: 27866/33451 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 312-474-6300 

TELEFAX: 312-474-0448 

TELEX: 25-3856 
; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS : 

LENGTH: 1780 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-08-994-570-5 

Query Match 4.5%; Score 265.5; DB 3; Length 1780; 

Best Local Similarity 20.8%; Pred. No. 3.4e-08; 

Matches 264; Conservative 146; Mismatches 486; Indels 371; Gaps 51; 

Qy 11 SSSTDSPPRPPPA FKYQFVTE PEDE EDEEEEEDEEED 47 

I I : I I I : II : I I : III I : : I : I I : I 

Db 277 SKSAESPTSPVTSETGSTFK-KFFTQGWAGWRKKTSFRKPKEDEVEASEKKKEQEPEKVD 335 

Qy 4 8 DEDLEELEVLERKPAAGLSAAAVPPAAAA — PLLDFSSD S VP PAP RG P L P AAP P AAP ERQ 105 

I : : I I II I I I : I I I : I II: II 
Db 336 T EE DGKAEVAS EKLT AS EQAH PQE P AE S AHE P RL S AE YEKV ELPS EEQ 383 

Qy 106 PSWERSPAAPAPSLPPAAAVLPSKLP EDDEPPARPPPPPPAGASP 150 

I : I : I : I I I I : I : : I I : I 

Db 384 VSGSQGPSEEKPA-PLATEVFDEKIEVHQEEWAEVHVSTVEERTEEQKTEVEETAGSVP 442 

Qy 151 L AE P — -AAP P S T P AAP KRRGS GS VDET L FAL P AAS E P VI P S S AEKIMDLMEQ P 201 

III I I I | | | : | | : I : : : I 

Db 443 AEELVGMDAEPQEAEPAKELVKLKETCVSGEDPTQGADLSPDEKVLSKPPEGWSEVEML 502 

Qy 2 02 GNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLG NLSAVSSSEGTI 253 

: : I I I : I I I I : I I : : I : 

Db 503 SSQERMKVQGSPLKKLFTSTGLKKLS GKKQKGKRGGGDEESGEHTQVPADSPDSQ 557 

Qy 254 EETLNEASKELPERATN-PFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVI 312 

II I : I II : : | M : : I I : : I I I : I : I 

Db 558 EEQKGESSAS S PEEPEEI TCLEKGLAEVQQDGEAEEGATSDGEKKREGVTPWAS FKKMVT 617 

Qy 313 VRSK DKE DLVCSAALHS PQES PVGKEDRWS PEKTMM FNEMQMSWAPV 362 

: : III | | | | I | : : :: ||: II I 

Db 618 PKKRVRRPSESDKEDELDKVKSATLSSTEST ASEMQEEMKGSVEEPK 664 



Qy 3 63 REEYADFKPFEQAWEV KDTYEGS 385 

II : I I III: 

Db 665 PEE PKRKVDTSVS WEAL I CVGS SKKRARRRS S SDEEGGPKAMGGDHQKADEAGKDKETGT 724 

Qy 386 RDVLA ARANVESK VDRKCLEDSLEQKSLGKDS 417 

: II I : I : I I : I I : I I : I I 

Db 725 DGILAGSQEHDPGQGSSSPEQAGSPTEGEGVSTWESFKRLVTPRKKSKSKLEEKS — EDS 782 

Qy 418 -EGRNEDASFPSTPEPVKDSS RAY I T CAS FT SAT E — ST 453 

I : I I I II I: I :| : I III I 

Db 783 IAGSGVEHSTPDT-EPGKEESWVSIKKFIPGRRKKRPDGKQEQAPVEDAGPTGANEDDSD 841 

Qy 454 TANTFPLLEDHTSENKTDE KKIEERKAQIITEKTSPKTSNPFLVAVQD 501 

III I : I :: I:: I :::: I : III 

Db 842 VP AWP L S E YDAVE RE KMEAQQAQ KGAEQ P EQ KAAT EVS KE L S E S QVHMMAAAVAD GT RA 901 

Qy 502 SEADYVTTDTLSK VTEAAVSNMPEG- 526 

: I I : I : I III : I I 

Db 902 ATIIEERSPSWISASWEPLEQVEAEAALLTEEVLEREVIAEEEPPTVTEP LPENR 957 

Qy 527 LTPDLVQEA-CESELNEATGTKI-AYETKVDLVQTSEAIQESLYPTAQ 572 

I I I : I I I I I : I I : : I : : I I : 

Db 958 EARGDTWSEAELTPEAVTAAETAGPLGSEEGTEASAAEETTEMVSAVSQLTDSPDTTEE 1017 

Qy 573 L CPS FE EAEAT P S P VL P D I VMEAP LN S L L P S AGAS WQ P S VS P L EAP P P VS 623 

I I I I II:: I I I I I : II 

Db 1018 ATPVQEVEGGVPDIEEQERRTQEVLQAVAEKVKEESQLPGTG GPEDVLQPVQ 1069 

Qy 624 YDSIKLEPENPPPYEEA MNVALK ALGTKEGIKEPESFNAAVQE 666 

: I I I II : I I I I I I I I I I I 

Db 1070 RAEAERPEEQAEASGLKKETDWLKVDAQEAKTEPFTQGKWGQTTPESFEKAPQV 1125 

Qy 667 TEAPYISIACDLIKETKLSTEPSPD — FSNYSEIAKFEKSVPEHAELVEDSSPES — EPV 722 

II: I : : : I I : I : I : : : I I : I I 

Db 1126 TES IESSELVTTCQAETLAGVKSQEMVMEQAIP PDSVETPT 1166 

Qy 723 DLFSDDSIP EVPQTQEEAVMLMKESLTEVSETVAQH KEERLSAS PQELGKPY 774 

I : I I I : I I : I : : I : I I : I I : : I 

Db 1167 DSETDGSTPVADFDAPGTTQ KDEIVEIHEENEVHLVPVRGTEAEAVPAQKERPP 1220 

Qy 775 LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

I II: : I : : I : : : | | : : | : : : : 

Db 1221 APSSFVFQEETKE QSKMEDTLEHTDKEVSVETVSILSKTEGTQEADQYA 1269 

Qy 835 DSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPC 885 

I : : II I I III: I : : I I I I I 

Db 1270 DEKTKDVPFFEGLEGSIDTGITVSREKVTEVALKGEGTEETVECKKDDAiELQSHAKSPPS 1329 

Qy 886 LELPCDLSF KNIYPKDEVHVSDEFSENRSS VSKASISPSNVSALEPQTE 934 

I : : : | | : : | | : : : III: I I : : I 

Db 1330 PVE REMWQ VE RE KT EAE P T H VN E E KLEH ET AVT VS E EVS KQ L LQT VN VP 1 1 D GAKE 1386 

Qy 935 MGSIVRSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWF — 992 

: I : I | : : : : : I hi : I I : I : I I 

Db 1387 VSSLEGSPPPCLGQEEAVCTKIQVQSSEASFTLTAAAEEEKV — LGETANILETGETLEP 1444 

Qy 993 -GASLFL 998 



Db 1445 AGAHLVL 1451 



RESULT 14 

US-09-134-001C-4463 

; Sequence 4463, Application US/09134001C 
;. Patent No. 6380370 
; GENERAL INFORMATION: 

; APPLICANT: Lynn Doucette-Stamm et al 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
STAPHYLOCOCCUS 

; TITLE OF INVENTION: EPIDERMIDIS FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE:, GTC-007 

; CURRENT APPLICATION NUMBER: US/09/134 , 001C 

; CURRENT FILING DATE: 1998-08-13 

; PRIOR APPLICATION NUMBER: US 60/064,964 

; PRIOR FILING DATE: 1997-11-08 

; PRIOR APPLICATION NUMBER: US 60/055,779 

; PRIOR FILING DATE: 1997-08-14 

; NUMBER OF SEQ ID NOS : 5674 

; SEQ ID NO 44 63 

LENGTH: 2137 

TYPE: PRT 

; ORGANISM: Staphylococcus epidermidis 
US-09-134-001C-4463 

Query Match 4.5%; Score 265; DB 4; Length 2137; 

Best Local Similarity 19.6%; Pred. No. 4.8e-08; 

Matches 172; Conservative 141; Mismatches 451; Indels 114; Gaps 24; 

Qy 159 STPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLE 218 

II: III : I : I : I : : I : I : ' I 

Db 990 STSTSTSDSASTSTSE SESDSASTSLSESTSTSVSDSTSTSTSDSASMSASESE 1043 



Qy 219 TAASLPSLSPLSTVSFKEHGYLGNLSAV SSSEGTIEETLNEASKELPERATNPFVNR 275 

: :. I I I : : I I : I I I : I I I : I I I : 

Db 1044 SNSKSTSLSESTSTSLS GSTSASTSDSASTSTSESESDSTSTSLSESTSTSLSGS 1098 



Qy 276 DLAEFSELEYSEMGS SFKGS PKGESAI LVE NTKEEVI VRS KDKEDLVC 323 

II: I I : : I I I I : | | : : | 

Db 1099 TSASTSD SASTSTSESDSTSESTSLSESLSTSVSDSTSASTSESASTSTSESESNSA 1155 



Qy 324 SAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYE 383 

|:| : : : : : I : : I : ": I :. 

Db 1156 STSLSGSLSTSISDSTSTSTSDSASTSTSESESDSTSTSLSE STSTSLSDSTS 1208 

Qy 384 GSRDVLAARANVESKVDRKCLEDSLEQK SLGKDSEGRNEDASFPSTPEPVKDSSRAY 440 

I I : : II I II: I : : I : : I I : I : 

Db 1209 TSTSESASTSTSES— DSTSESTSLSESTSTSVSDSTSASTSDSASTSTSVSDSESASTS 1266 



Qy 441 ITCASFTSATESTTANTFPLLEDHTSE — NKTDEKKIEERKAQIITEKTSPKTSNPFLVA 4 98 

I : : I I : : I I : : I III.::: : I : : : : I I I I : : 

Db 1267 ISESXSTSVSDSTSTSTSDSASTSTSESDSTSESTSLSESISTSVSDSTSASTSDSASTS 1326 

Qy 4 99 VQDSEADYVTTD TLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYE 551 

: I I : I : I : I I I : I : I : I : I : : I I : : 



Db 



1327 TSESESDSASTSLSGSTSTSLSDSTSTSTSDSASTSTSESDSERASTSLSGSTSTSLSDS 138 6 



Qy 552 TKV DLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASV 608 

I I I ! : : : I I : I I I : : I I I I I 

Db 1387 TSTSTSDSASTSTSVSDS NSASTSLSGSLSTSVSDSTSTSTSDSASAST 1435 

Qy 609 VQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK-ALGTKEGIKEPESFNAAVQET 667 

III II :: : : : I : | I : :: I : 

Db 1436 SESDSERA STSLSGSTSTSISDSTSTSTSDSASTSTSVSESNSTSTSISES 1486 

Qy 668 EAPYI SIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSD 727 

: : I III | : : : :. : : I I I : : I I : 

Db 1487 LSTSVS DSTSTSTSDSASTSTSVSDSDSASTSSSESV — STSDSESTSTSTS 1536 

Qy 728 DSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQE LGKPYLESFQ 779 

II I : : : I : I I I : : : : III II 

Db 1537 DSASTSTSVSESNSTSTSLSGSTSTSVSDSTSTSTSDSASASTSESDSDSASTSSSESVS 1596 

Qy 780 PNLH STKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

: : I I : : I I : I : I I I I : I 

Db 1597 TSVSDST-SASTSESASTSTSVSDSNSASTSLSESTSTSLSDSTSMSTSDSASTSTSESDS 1656 

Qy 835 DSSPIEIID EFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADS 882 

II: : I INI:: : : I I : : : I I : I 

Db 1657 DSASTSLSDSTSTSVSESTSTSTSTSVSASNSTSTSLSDSRSTSLSDSTSTSTSESGSTS 1716 

Qy 883 LPCLELPCDLSFKNIYPKDEVHVSDEFSENR — SSVSKASISPSNVSALEPQTEMGSIVK 940 

I I : :: : I I I : I : : I : I I I : : : I 

Db 1717 TS — ESDSDSASTSLSESTSTSISDSTSTSTSDSASTSMSVSDSNRASTSLSDSTSTSVS 1774 

Qy 941 SKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWD 97 8 

: : I I II II 111:1111 

Db 1775 DSTSASTSESASTSTRESESTSASTSLS-ESTSTSVSD 1811 



RESULT 15 

US-09-621-976-4600 

; Sequence 4600, Application US/09621976 
; Patent No. 6639063 
; GENERAL INFORMATION: 

APPLICANT: Dumas Milne Edwards, J.B. 
; APPLICANT: Jobert, S. 
; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: ESTs and Encoded Human Proteins. 

; FILE REFERENCE: GENSET . 054 PR2 

; CURRENT APPLICATION NUMBER: US/09/621, 976 

; CURRENT FILING DATE: 2000-07-21 

; NUMBER OF SEQ ID NOS : 19335 

; SOFTWARE: Patent. pm 

; SEQ ID NO 4600 

LENGTH: 75 

TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/KEY: UNSURE 
; LOCATION: 58 

; OTHER INFORMATION: Xaa - His, Pro 



NAME /KEY: UNSURE 
LOCATION: 28 

OTHER INFORMATION: Xaa = Met,Val- 
NAME/ KEY: UNSURE 
LOCATION: 19 

OTHER INFORMATION: Xaa = Pro, Gin 
NAME/ KEY: UNSURE 
LOCATION: 53 

OTHER INFORMATION: Xaa = Ser,Tyr 
US-09-621-976-4600 

Query Match 4.4%; Score 256.5; DB 4; Length 75; 

Best Local Similarity 67.0%; Pred. No. 1.2e-09; 

Matches 59; Conservative 3; Mismatches 11; Indels 15; Gaps 3; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I I I I I I I I I I I I II I I II II I I 

Db 1 MEDLDQS PLVS S S- DS P PRXQPAFKYQFXREPEDEE EDLEELEVLERK 47 

Qy 61 PAAGLSAAAV — PPAAAAPLLDFSSDSV 86 

I I I I I I I I 111111:11:11 

Db 48 PAAGLXAAP VXT APAAGAP LMD FGND FV 75 



Search completed: September 3, 2004, 16:10:34 
Job time : 28.9601 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: September 3, 2004, 16:00:14 ; Search time 24.8344 Seconds 

(without alignments) 
4504.667 Million cell updates/sec 

Title: US-09-830-972-2 
Perfect score: 5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 1163 



Sequence : 
Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5* 



Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database 



PIRJ78:* 
pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
A46583 

neuroendocrine-specif ic protein, splice form A - human 

N; Contains: neuroendocrine-specif ic protein, splice form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A. J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A;Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specific gene and identification of its 135-kDa translational 
product. 

A; Reference number: A46583; MUID: 93293865; PMID: 7685762 
A;Accession: A46583 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-776 <ROEl> 

A;Cross-references: GB:L10333; NID:g307306; PIDN : AAA59950 . 1 ; PID:g307307 
A;Accession: 160903 



A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 421-776 <R0E2> . 

A;Cross-references : GB:L10334; NID:g307308; PIDN : AAA59951 . 1 ; PID:g307309 

C; Genetics: 

A; Gene: GDB : RTN1 ; NSP 

A;Cross-references : GDB:203968; OMIM: 600865 
A; Map position: 14q21-14q22. 

Query Match 13.5%; Score 789.5; DB 2; Length 776; 

Best Local Similarity 31.2%; Pred. No. 3.5e-25; 

Matches 243; Conservative 115; Mismatches 252; Indels 169; Gaps 28; 

Qy 487 TSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGT 546 

: I : I I I : : : : I : I II : I : I : 
Db 65 S GPARQ S P — VAMET AS T GVAGVS S AMDHT FS TT S KDGEG SCYTSLI S 110 

Qy 547 KIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI— VMEAPLNS- 599 

I I : I : : | | : : | I I MINI:: II 
Db 111 DICYPPQEDSTYFTGILQKENGHVTISESP EELGTPGPSLPDVPGIESRGLFSSDSG 167 

Qy 600 — LLPSAGASWQPSVSPLEAPPPVSY DSIKLEPENPPPYEEA M 641 

: I : I : II: : I : : I : : : I I : 

Db 168 IEMTP.AESTEVNKILADPLDQMKAEAYKYIDITRPEEVKHQEQHHPELEDKDLDFKNKDT 227 

Qy 642 NVALKALGTKEGI KEPE SFNAAVQETEAPYISIACDLIKETKLSTE-PSP 690 

: : : : I I I : : I I : : : I I I I I I I : I : : : : I 

Db 228 DISIK PEGVREPDKPAPVEGKIIKDHLLEESTFAPYID DLSEEQRRAPQITTP 280 

Qy 691 DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSIPEV PQTQEEAVMLMK 745 

: I I I :: : | | : : | | | :: | | | : : 

Db 281 VKITLTEIE PSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPEDDSPGSITPP 334 

Qy 74 6 ESLTEVSETVAQHK EERLSASPQELGKPYLESFQP NLHSTKDAASND 7 92 

I I I I : I I : | :: | : | | : | | : : 

Db 335 SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRPEVKARSGPPT 394 

Qy 793 IPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKI 827 

II: II:: : I I : : I I : 

Db * 395 IPSPLDHEAS SAESGDSEIELVSEDPM — AAE DAL P S G YVS FGHVGG P P P S PAS P S I Q Y S 452 

Qy 828 KESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI -ANIQSG 879 

: I I 11-1:11. I I : : I I I : : : : I I : 

Db 453 ILREEREAELDS ELIIESCDASSASEESPKREQDSPPMKPSALDAI REETGVRAEER 509 

Qy 880 ADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIV 939 

II II II : II I I I I II I : I I 
Db 510 APSRRGLAEPG — SFLD-YPSTEPQPGPEL PPGDGALEPETPM 549 

Qy 940 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRD 984 

II I : : I : : : hi : I II I I I I 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRD 597 

Qy 985 IKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPF 1044 

I I : I I : I I I : I I ! I I I I I : I I I I : I I I II I I I I I I I I I : I I : I I : I I I I I I 
Db 598 IKQTGIVFGSFLLLLFSLTQFSWSVVAYLALAALSATISFRIYKSVLQAVQKTDEGHPF 657 



Qy 1045 RAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGA 1104 

Mill I : : I : I : I I I : : : I I I I : II I I I I II I I I I I I I II I I I I I : I I I I I 

Db 658 KAYLELEITLSQEQIQKYTDCLQFYWSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGA 717 

Qy 1105 LFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I : : I : : I : I : : I I : I : I I III I I I I : : I I I I I I I I I II I : 

Db 718 LFNGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 776 

RESULT 2 
160904 

neuroendocrine-specif ic protein C - human 
C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C; Accession: 160904 

R;Roebroek, A. J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A; Reference number: A46583; MUID: 93293865; PMID: 7685762 
A;Accession: 160904 

A; Status: preliminary; translated from GB/ EMBL/ DDB J 
A;Molecule type: mRNA 
A; Residues: 1-208 <RES> 

A; Cross-references: GB:L10335; NID:g307310; PIDN : AAA59952 . 1 ; PID:g307311 

C; Genetics: 

A; Gene: GDB : RTN1 ; NSP 

A; Cross-references : GDB: 203968; OMIM: 600865 
A; Map position: 14q21-14q22 

Query Match 11.7%; Score 685; DB 2; Length 208; 

Best Local Similarity 67.0%; Pred. No. le-21; 

Matches 128; Conservative 32; Mismatches 31; Indels 0; Gaps 0; 

Qy 973 KTSVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1032 

I : : I I I I I I I II : I I : I I I : I II III I I : I I I I I : I I I II I I I I I I I I I : 
Db 18 KSQAI DLLYWRDI KQTGI VFGS FLLLLFSLTQFS WSWAYLALAALSATI S FRI YKSVL 77 

Qy 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

I I : I I : I I I I I I : I I I I I : : I : I : I I I : : : I I I I : I I I I I I I I I I I I I I I I I 

Db 78 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 137 

Qy 1093 AVLMWFTWGALFNGLTLLILAXISLFSIPVIYERHQV 1152 

Mill: I I I I I I I I I I I II : : I : : I : I : : I I : I : I I III I I I I : : I I I I 

Db 138 AVIM^LLTWGALFNGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQ 197 

Qy 1153 AKI PGLKRKAD 1163 

I I I I I I I I : 
Db 198 AKI PGAKRHAF, 208 



RESULT 3 
A60021 

tropomyosin-related protein, neuronal - rat 
C; Species: Rattus norvegicus (Norway rat) 



C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C;Accession: A60021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 

A; Title: Developmentally regulated cDNA expressed exclusively in neural tissue. 
A; Reference number: A60021; MUID : 91278684 ; PMID: 1647480 
A;Accession: A60021 
A; Molecule type: mRNA 
A; Residues: 1-267 <WIE> 

A; Cross-references: EMBL:X52817; NID:g456549; PIDN : CAA37001 . 1 ; PID:g456550 
C; Comment: This neuronal-specif ic mRNA was identified by hybridization to an 
alpha-tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 11.5%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 5.6e-21; 

Matches 124; Conservative 33; Mismatches 30; Indels 0; Gaps 0; 

Qy 973 KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI S FRI YKGVI 1032 

I : : I I I I I I I I I : I I : I I I : I II III I I : I II I I : I I I II I II I I I II I : 
Db 9 KSQAI DLLYWRDI KQTGI VFGS FLLLLFS LTQFS WSWAYLALAALSATI S FRI YKSVL 68 



Qy 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

' I I : I I : I I I I II : I I I I I : : I : I : I I I :: : I I I I : I I I I I I I I I MINIM 

Db 69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 128 

Qy 1093 AVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQ 1152 

Mill: I I I I I I I I I I I I I : : I : : I : I : : I I : I Ml Ml I I I I : Mill 

Db 12 9 AVLMWLLTWGALFNGLTLLLMAWSMFTLPWWKHQAQVDQYLGLVRTHINTWAKIQ 18 8 



Qy 1153 AKIPGLK 1159 

I I I I I : 
Db 189 AKIPGAR 195 



RESULT 4 
T26216 

hypothetical protein W0 6A7.3c - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequenc.e_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T2 6216 
R;Ains cough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A; Accession: T2 6216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2484 <WIL> 

A; Cross-references : EMBL:Z78066; PIDN : CAB51467 . 1 ; GSPDB: GN00023 ; CESP : W06A7 . 3c 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP : W06A7 . 3c 

A; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 



Query Match 8.8%; Score 517; DB 2; Length 2484; 

Best Local Similarity 20.1%; Pred. No. 2.7e-13; 

Matches 262; Conservative 189; Mismatches 413; Indels 440; Gaps 51; 



Qy 28 FVTEPEDEE DEEEEEDEEED DEDLEELEVL 57 

I I I : I I :| III) I I I : I : I 

Db 1437 FGTESSEESQKADGNQENQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDV 1496 

Qy 58 ERKP AAGLS AAAVP PAAAAP L-LDFSSD S VP PAP RG P L P AAP PAAP — ERQPSW 108 

I : : I I I : : I I I I :: I I I : I II I I 

Db 14 97 DASDVNEQDEESTLKILKWPSEPSLLELDFTND — PKVIHVPIPLMEPATMYLEEMVEW 1554 

Qy 109 ERSPAAPAPSLPPAAAVLPS KLP-EDDEPPARPPPPPPA 146 

I : I I : : : | I I I I I I I III 

Db 1555 IIADAVKEVSEMEVVTESEISEiytAPQVSESTCPIPEPLADLKLPVEDDEKTPEPEPWPG 1614 

Qy 147 GASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPS-SAEKIMDLMEQPGNTV 205 

I I I : I I M :: I : I I : I : I I 

Db 1615 QVQERIIPIEVEQAPTIPQRPPRAPKSE LPKVAKPLDDSKSRVRFAPLNIKLGRTY 1670 

Qy 206 SSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELP 265 

II: III II::: :: I 

Db 1671 SEEQQK ELVESLE — RPLTIIT : QQKPP 1695 

Qy 266 ERAT NPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKE 319 

I : I : I I I I : I : : I I 

Db 1696 EKPTEDIGALSPLSPNTLAEYEEVPMMDMQS 1726 

Qy 320 DLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQ-MSWAPVREEYADFKPFEQAWEV 378 

I I I I I I : : : I : I I I I I I I I I I 

Db 1727 VPHSPQE KQEEI EALSEIIEEPQAMKEVE KPVESAPE- 1763 

Qy 379 KDTYEGSRDVLAARANVESKVDRKCLEDSL — EQKSLGKDSEGRNEDASFPSTPEPVKDS 436 

II : I I : : I : I : III r I I : I 

Db 1764 KD NESLEAPEIINEPIRRVLVETKIMGPGKSLNED NDD 1801 

Qy 437 SRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPKTSNPFL 496 

: I II : I I II 

Db 1802 DDDGSECLDSIGDLSER . TIQRFN 1824 

Qy 4 97 VAVQDSEADYVTTDTLSKVT EAAVSNMPEGLTPDLVQEACESELNEATGTKI 548 

: : I : | : | : : I : I : : I I 

Db 1825 TSIDDPS 1 RRDS FS S I S S FGDRQKFRTAI ENI RQDLLP 1862 

Qy 549 AYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASV 608 

: : : I I : : : I I : I I : : : : I : : I : I I : I 

Db 18 63 -FQSSV SQYLRSSPNPSQQL LVTNLSMDSP-SDLSPNA 1898 

Qy 609 VQPSVSPLEAPPPVSYDSI KLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAA 663 

III::: I I : I : I I : : : I : : : : 

Db 18 99 PPVGFENTAQFLEKLQQEDRPSAEGS I D SSGFEKVD— HEG 1937 

Qy 664 VQETEAPYISIACDLIKETKLSTEPSPDF SNYSEIAKFEKSVPE 707 

: I I I : I : : : : : I I : I : : I : I 

Db 1938 LDEFAAPPVH DPMQKSVFGSLGSDDMKPGSQDDGFVFIERNEANEATLKKNQKMSSH 1994 

Qy 708 HAELVEDS SPESEPVDLFSDDSIPEVPQTQEEAV MLMKESLTE 750 

I : : : I : I I : I I : : : : : : I I : : I I 

Db 1995 HNDVIEKNYFNDNAPTAALLESPIAEEARKLVQDAVESASEYKKQAVDSGDEIGRELLDN 2054 



Qy 751 VSETVAQHKE ERLSASPQELGKPYLESFQP 780 

I : : I I I I I : : I : : I 

Db 2055 VEQKIEQVKEPIVDSLHKAYDGVGDFVHETVPNAVDDFVREAEKQLPESPVPEKIETPEP 2114 

Qy 781 — NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDK IKES 830 

: : I I I : : : : : I I : I I : : I I : : 

Db 2115 LVDI HDTVDKVHDEVDN FLRRE P TPPFETDDVAPLSDDKPQFGNQTPEEDE 2165 

Qy 831 ETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGA DSLPCL 886 

II I : I : I : I : :': : : I I : II : I I 

Db 2166 TTFDRKGPLTI PEEVEKAAAAQNND — - L D D FD P L VT S NT GAAFGAAVGAAAAVE S LT E E 2222 

Qy 887 ELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKASISPSNVSA LEPQTEM 935 

I : I : : III : I I I : I I I : I : 

Db 2223 EMFGHQKFETVPRP PTP PKD ISDE-r DVKPSTVNLGPSHHHSHPSSPH 2268 

Qy 936 GSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGAS 995 

I I : I I : : I : I : : I I I I I I : : I : 

Db 2269 HSILKHHG DAW I D FKT VP P CVLDVI YWRDAKKS AI VL S LA 2308 

Qy 996 LFLLLS LTVFS I VSVTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLESEVAI S 1055 

I : I I : : : : I I I I I I : : I I I : I : I III I : : : 

Db 2309 LLVLFVLAKYPLLTVWYSLLIALGAAAGFRVFKKVEAQIKKTDSEHPFSEILAQDLTLP 2368 

Qy 1056 EELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG7VLFNGLTLLILA 1115 

Ml : : : I : I : : I I : : : I : I I : : : I I I : : I : I I I I I 

Db 2369 QEKVHAQADVFVEHATCIv^NKLKKLVFVESPLESIKFGLVLWSLTYIASWFSGFTLAILG 2428 

Qy 1116 LISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLK 1159 

I : : I I : I : I I : I I I : I : : I : I I : I I :. 

Db 2429 LLGVFSVPKVYESNQEAIDPHLATISGHLKMVQNIIDEKLPFLR 2472 



RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T26215 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A;Accession: T26215 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2607 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN : CAB01522 . 2 ; GSPDB : GN00023 ; CESP : W06A7 . 3a 

A; Experimental source: clone W06A7 

C;Genetics: 

A; Gene: CESP : W06A7 . 3a 
A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 



Query Match 8.6%; Score 503.5; DB 2; Length 2607; 

Best Local Similarity 20.8%; Pred. No. le-12; 

Matches 280; Conservative 181; Mismatches 481; Indels 407; Gaps 52; 



Qy 28 FVTEPEDEE DEEEEEDEEED DEDLEELEVL 57 

I I I : I I : I M I I | | | : | : I 

Db 1437 FGTESSEESQKADGNQENQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDV 1496 

Qy 58 ERKP AAGL S AAAVP P AAAAP L- LD FS S D S VP PAP RG P L P AAP PAAP — ERQPSW 108 

I : : I I I : : I I I I : : I I I : I II I I 

Db 1497 DASDVNEQDEESTLKILKWPSEPSLLELDFTND — P KVI HVP I P LME PATM YLEEMVEW 1554 

Qy 109 ERSPAAPAPSLPPAAAVLPS KLP-EDDEPPARPPPPPPA 146 

I : | I : : : | I I I I I I I III 

Db 1555 IIADAVKEVSEMEVWESEISEMAPQVSESTCPIPEPLADLKLPVEDDEKTPEPEPWPG 1614 

Qy 147 GAS P LAE PAAP P S T PAAP KRRGS G S VDET L FAL P AAS E P VI PS- S AE K IMD LMEQ P GNT V 205 

I I I : I I | | : : | : I I : I : I I 

Db 1615 QVQERIIPIEVEQAPTIPQRPPRAPKSE -LPKVAKPLDDSKSRVRFAPLNIKLGRTY 1670 

Qy 206 SSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELP 265 

II: I II II::: : : I 

Db 1671 SEEQQK ELVES LE — RPLTIIT QQKPP 1695 

Qy 266 ERAT NPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKE 319 

1:1 : I I I I : I : : I I II : : I I : I : I 

Db 1696 EKPTEDIGALSPLSPNTLAEYEEVPMMDM-QSVPHSP QEKQEEIEALSEIIE 1746 

Qy 320 DLVCSAALHS PQES- PVGKEDRWS PEKTMDI FNEMQMSWAPVREEYADFK PFEQA 375 

: : I I I I : : : I I III hi : I I : 

Db 1747 EPQAMKEVEKPVESAPEKDNESLEAPE IINE PIRRVLVETKIMGPGKSL 1795 

Qy 376 WEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDS E 418 

I I : : I : :: : :: |:: |: :|| I 

Db 1796 NEDNDDDDDGSECLDSIGDLSERTIQR- FNTSIDDPSIRRDSFSSISSFGDRQKFRTAIE 1854 

Qy 419 GRNED ASFPSTPEPVKDSSRAYITCAS FTSATESTT ANTFPLLE 462 

: I 1:11 | : : | | I : : : I I I I 

Db 1855 NIRQDLLPFQSSVSQYLRSSPNP SQQLLVTNLSMDSPSDLSPNAPPVGFENTAQFLE 1911 

Qy ' 4 63 DHT S ENKTDEKKI EERKAQI I ■ TEKTSPKTSNPF 495 

I : | I : :: I I : :: | : : 

Db 1912 KLQQEDRPSAEGSIDSSGFEKVDHEGLDEFAAPPVHDPMQKSVFGSLGSDDMKPGSQDDG 1971 

Qy 4 96 LVAVQD S EAD YVT TDTLSK VTEAAVSNMPEGLTPDLVQ 533 

. I : : : I I : I I : I : I : : : 'III: 

Db 1972 FVFI ERNEANEATLKKNQKMS SHHNDVI EKNYFNDNAPTAALLES PI AEEARKLVQDAVE 2031 

Qy 534 EACE -SELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVL 58 8 

II : : | :: : : | | | : | | : | I I : I 

Db 2032 SASEYKKQAVDSGDEIGRELLDNVEQKIEQVKEPIVDSLHKAYDGVGDFVH-ETVPNAV- 2089 

Qy 589 PDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKAL 648 

I I I I I I I : I I 
Db 2090 DDFVREAE KQLPESPVP 2106 

Qy 649 GTKEGIKEPESFNAAVQETEAPYISIACDLIK ETKLSTEPSPDFSNYSEIAKFEK 703 

11:11 I : I : I : I I I : I I 
Db 2107 EKIETPE PLVDI HDTVDKVHDEVDN FLRREPT P P FE 2142 



Qy 

Db 



704 SVPEHAELVEDSSPESEPVDLFSDDS I PEVPQTQEEAVMLMKESLT EVSETVAQHKE 760 

: I : I I : I : : II ::| I I I II : I 

2143 TDDVAPLSDDKPQFGNQT-PE EDETTFDRKGPLTI PEEVEKAAAAQNN 2189 



Qy 761 ERLSAS PQELGKP YLES FQPNLHSTKDA- ASNDI PTLTKKEKI SLQMEEFNTAI 813 

: | : | | : | | | : : : I I :: I I : I I 

Db 2190 D L D D FD P LVT S NT GAAFGAAVGAAAAVE S LT EE EMFGH Q — KFETVP 2234 

Qy 814 YSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSP — KLAKEYTDLEVSDKS 871 

:: :| I :: |: : II : I : I : |: 

Db 2235 RPPTPPKDISDEDVKPS TVNLGPSHHHSHPSSPHHSILKHHGDAWIDFKT 2284 

Qy 872 EIANIQSG ADSLPCLE LPCDLSFKNI YPKDEVH 904 

I : : I I I I : I I I I : I 

Db 2285 VPPCAQNAFSPGEIMFLLAFFVYLSCFASFFSKSLPLLDNLLSLWYLSISLI IH 2339 

Qy 905 VSDE FSENRSSVSKASISPSNVSA LEPQTEMGSIVKSKSLTKEAEK 950 

I . | :::| | : | I : :| III 

Db 2340 VKH H RK FRWN E EQATTMS KL GAVGRGL YAL I AF I VN I VL RVGLNVALWGVAVS AH EAYK 2399 

Qy 951 KLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSV 1010 

I I II | : I : : I I M I I : : I : I : I I : : : : I 

Db 2400 LTKS SGVLRKKEVLDVIYWRDAKKSAIVLSLAiLVXFVTjAKYPLLTV 2446 

Qy 1011 TAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHV 1070 

I I I I I : : I I I : I : I III I : : : : I I : : : I 

Db 2447 WYSLLLALGAAAGFRVFKKVEAQIKKTDSEHPFSEILAQDLTLPQEKVHAQADVFVEHA 2506 

Qy 1071 NSTIKELRRLFLVDDLVDSLKFAVUKWVFTYVGALFNGLTLLILALISLFSIPVIYERHQ 1130 

: I : : I I : : : I : I I : : : I I I : : I : I I I I I I : : I I : I : M : I 
Db 2507 TCIANKLKKLVFVESPLESIKFGLVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQ 2566 

Qy 1131 VQIDHYLGLANKSVKDAMAKIQAKIPGLK 1159 

I I : I : : I : I hi h 

Db 2567 EAIDPHLATISGHLKNVQNIIDEKLPFLR 2595 



RESULT 6 
T13564 

microtubule-associated protein homolog - fruit fly (Drosophila melanogaster ) 
N;Alternate names: hypothetical protein EG:49E4.1 
C; Species: Drosophila melanogaster 

C;Date: 13-Aug-1999 ttsequence_revision 13-Aug-1999 #text_change 17-Nov-2000 
C;Accession: T13564 

R;Spanos, L. ; Papagiannakis , G.; Siden-Kiamos , I.; Louis, C. 
submitted to the EMBL Data Library, April 1999 

A; Description : Sequencing the distal X chromosome of Drosophila melanogaster. 
A;Reference number: Z17689 
A;Accession: T 135 64 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-5327 <SPA> 

A;Cross-references: EMBL :AL03 1128 ; PIDN: CAA20006 . 1 
C; Genetics: 

A;Cross-references : FlyBase : FBgn0025392 

A;Introns: 24/2; 52/3; 104/3; 179/1; 232/1; 1669/3; 2566/1; 4798/3; 5272/1 
A; Note: EG:49E4.1 



C; Super family : Drosophila 576K microtubule-associated protein homolog 



Query Match 5.6%; Score 328.5; DB 2; Length 5327; 

Best Local Similarity 22.7%; Pred. No. 4.1e-05; 

Matches .255; Conservative 176; Mismatches 453; Indels 238; Gaps 54; 

Qy 2 EDIDQSSLVSSSTDSPPRPPPAFKYQFVTEP EDEEDEEEEEDEEEDDEDLEELEVL 57 

:: I : I I : I I I : I I : I I I : I I I : I I 

Db 3196 DEADKSKEESRRESGAEKSPLASKE — ASRPASVAESIKDEAEKSKEESRRESVAEKSPL 3253 

Qy 58 ERKPAAGLSAAA — VPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAP 102 

I I : : : I I I : I I I I I : I I : : I 

Db 3254 PSKEASRPTSVAKSVKDEAEKSKEESSRDSV — AEKSPLASKEASRPASVAESVQDEAEK 3311 

Qy 103 ERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLA-EPAA 156 

I : hill : I I : I I I : : I II I I : I : 

Db 3312 SKEESRRESVAEKSPLAYKEASRP-ASVAESIKDEAEKSKEESRRESVAEKSPLASKEAS 3370 

Qy 157 PPSTPA APKRRGSGSVDETLFALPAAS-EPVIPSS-AEKIMDLMEQPGNTVSSG 2 08 

I : : I II: II I I I I | : I I I : I I : 

Db 3371 RPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRE 3430 

Qy 209 QEDFPSVLLETAASLPSLSPLSTVSFKEHG YLGNLSAVS S'S'EGTI EETLNEAS 2 61 

II III: I I : I :: I I : :: I : 

Db 3431 SVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPLASKEASRPASVAESV 3490 

Qy 262 KELPERATNPFVNRDLAEFSELEYSEMG— SSFKGSPKGESAILVENTKEEVIVRSKDKE 319 

I : I : : Mi ll I : I III: I : I I I : I 

Db 3491 KDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEA EKSKEE SRRE 3541 

Qy 320 DLVCSAALHSPQES-PVGKEDRW-SPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWE 377 

: : : I I : I I : I I I : : : : : | : : | : I I 

Db 3542 SVAEKSSLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLASKEAS-RPASVAES 3600 

Qy 378 VKDTYEGSRDV LAARANVESKVDRK — CLEDSLEQKSLGKDSEGRNE 422 

III I I : : I : | : : : M : : : | : : : : I I I 

Db 3601 VKDEAEKSKEVSRRESVAEKSPLPSKEASRPTSVAESVKDEADKSKEESRRESGAEKSPL 3660 

Qy 423 DASFP-STPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTS ENKTD- 471 

: I I I I I I I I : : II I : II I hi 

Db 3661 ASMEASRPTSVAESVKDETEK SKEESRRESVTEKS-PLPSKEASRPTSVAESVKDE 3715 

Qy 472 -EKKIEERKAQIITEKT SPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG- 526 

II | | : : : I I : I :: I I I h : : I : : I I I 

Db 3716 AEKSKEESRRESVAEKSPLASKESSRP ASVAESIKDEAEGTKQESRRESMPESG 3769 

Qy 527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYP 569 

I I I I : : I : I : I I : h : I 

Db 3770 KAESIKGDQSSLASKETSRPDSWESVKDETEKPEGSAI DKSQVAS RP ES VAVSAKD 3826 

Qy 570 T AQLC P S FEEAEAT P S PVL P D I VMEAP LN S LL P S AGAS WQ P S VS P LEAP P P VS YD S I KL 629 

I h I I I f I I I I I : : I h I I I 
Db 3827 EKSPLHSRPESVADKS PDASKEA SRSLSVAETASSPIEEGPRSIAD 3872 

Qy 630 EPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPS 689 

::: I . I :| I I : :: : I :: I I : I 

Db 3873 LSLPLNLTGEAKG-KLP-TLSSPIDVAEGDFL EVKAESSPR 3911 



Qy 690 PDFSNYSEIAKFEKSVPEH — AELVEDSSPESEPVDLF SDDSIPEV 733 

I : I : I : I : | : |:::|| | ::: :: : :: 

Db 3912 P — AVLS KPAEFS Q PDTGHTAST P VDEAS PVLEE I EWEQHTT S GVGATGATAETDLLDL 3969 

Qy 734 PQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASNDI 793 

: I : I I I : I I : I I : I : : : I I I : : : : : : 

Db 3970 TETKSETV — TKQSETTLFETLTSKVESKVEVLESSV- KQVEEKVQTSVKQAETTVTDSL 4026 

Qy 7 94 PTLTKKEKISLQMEE FNTAIYSNDDLLSSKEDKIKESETFSDSSP 83 8 

I I I I I I : I I : I I : : : : : I : I I 
Db 4027 EQLTKKS — SEQLTEIKSVLDTNISNVTNLFSTAVETIEKKVQDVTEKVIEKAT 4078 

Qy 839 I EI I DEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIAN I QS GADS LPCLELPCDLSFKNIY 898 

I : I I : : I : : : I : I : : I I : I : I 
Db 4079 -EHVSEHVT— TTGESSTETSQEKSSLDLGTFSELRETHITTVGSPEFTV TIC 4128 

Qy 899 PKDE--VH-VSDEFSENR SSVSKAS-ISPSNVSALEPQTE MGSIVKS- 941 

: I I : I : : I I : I I I I I : I I : I I : I : : : I I 

Db 4129 ERDEPVLHDIKEEDEEHRFSPPSDVDKAAIIPPQPMRPLSPREEEVAKIVADVAKVLKSD 4188 

Qy 942 KSLT KEAEKKLPS — DTEKE-DRSLSAVLSAELS 972 

I : I : : | : | | | I I I : I I : I I I : I 

Db 4189 KDITDII PDFDERQLEEKLKSTADTEEESDKSTRDEKSLEI S 4230 



RESULT 7 
138346 

elastic titin - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change 21-Jul-2000 
C;Accession: 138346 
R;Labeit, S.; Kolmerer, B. 
Science 270 f 293-296, 1995 

A;Title: Titins: giant proteins in charge of muscle ultrastructure and 
elasticity. 

A; Reference number: A57430; MUID: 96026330 ; PMID: 7569978 
A;Accession: 138346 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-7962 <RES> 

A; Cross-references: EMBL : X90569 ; NID: gl017426; PIDN : CAA62 189 . 1 ; PID:gl017427 

C; Genetics: 

A; Gene: GDB : TTN 

A; Cross-references: GDB: 127867; OMIM: 188840 
A; Map position: 2q31-2q31 

Query Match 5.5%; Score 322; DB 2; Length 7962; 

Best Local Similarity 21.7%; Pred. No. 0.00014; 

Matches 259; Conservative 146; Mismatches 416; Indels 370; Gaps 58; 

Qy 23 AFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVL ERKPAAGL 65 

I I : : I I I : I I I I I I : : I : I I I : I : I I 

Db 6574 AFEEEVVTHVEEYLVEEEEEYIHEEEEFITEEEVVPVT PVKVPEVPRKPVPEEKKPVPVP 6633 

Qy 66 SAAAVPPAAA 

I I I 



-APLLDFSSDSVPPA PRGPLP AAP 98 

I : I : III I : I : I I I 



Db 6634 KKKEAPPAKVPEVPKKPEEKVPVLIPKKEKPPPAKVPEVPKKPVPEEKVPVPVPKKVEAP 6693 

Qy 99 PA-APE— RQPSWERSPAAPAP SLPPA-AAVLPSKL-PEDDEP PAR PPPPP 144 

II II : : I I : III III : | I I I I : : i I : III 

Db 6694 PAKVPEVPKKPVPEKKVPVPAPKKVEAPPAKVPEVPKKLIPEEKKPTPVPKKVEAPPPKV 6753 

Qy 145 P AGAS P LAE P AAP P S T P AAP KRRG S G S VD ET L FAL P AAS E P VI P S S AE K IMD LMEQ P GNT 204 

I hill : I I I I : : I I : : : I : 
Db 6754 PKKREPVPVPVALPQE EEVLF EEEIVPE — EEVLPEEEE 6790 

Qy 205 VSSGQEDFPSVLLETAASLP SLSPLSTVSFKEHGYLGNLSAVSSSEGTI — EETLNE 259 

I : I : III II : I II I I : II I I 

Db 6791 VLPEEEE VLPEEEEVLPEEEEIPPEEEEVPPEEEY VPEEEEFVPEEEVLPE 6841 

Qy 260 ASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESA 1 LVENTKEEVI V 313 

: : I I I : : : I : I I I I : : : | : | : 

Db 6842 VKPKVPVPAPVPEIKKKVTEKKW 1 PKKEEAPPAKVPEVPKKVEEKRI I 6890 

Qy 314 RSKDKEDLVCSAALH S PQES PVGKED RWSPEKTMDIFNEM 354 

I : : I I | : | | : : | : | : | : : : : 

Db 6891 LPKEEEVLPVEVT-EEPEEEPISEEEIPEEPPSIEEVEEVAPPRVPEVIKKAVPEAPTPV 6949 

Qy 355 - QMS WAP VREE YAD FK P F EQ AW E VK D T Y E G S RD VLAARAN VE S K 398 

: I I I::: I I : I I : :: : | 

Db 6950 PKKVEAPPAKVSKKIPEEKVPVPVQKKEA PPAKVPEVPKKVPEKKVLVPKKEAVPPA 7006 

Qy 399 VDRKCLEDS LEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCA 444 

III: | | : : : | | : | | I 
Db 7007 KGRTVLEEKVSVAFRQEWVKERLELEWEAEVEEIPEEEEFHEVEE YFEEG 7058 

Qy 445 SFTSATESTTANTFPLLEDHTSEN KTDEKKIEER KAQIITEKTSPKT 491 

I I : I : I I : : | : : | : | I : I I I 

Db 7059 EFHEVEEFI KLEQHRVEEEHRVEKVHRVI EVFEAEEVEVFEKPKAPPKGPEI SEKI I PPK 7118 

Qy 492 SNPFLVAVQDSEADYVTTDTLSKVTEAAV SNMPEGLTPDLV — 532 

II: I III : : | | | | | 

Db 7119 KPPTKWPRKEPPAKVPEVPKKIWEEKVRVPEEPRVPPTKVPEVLPPKEWPEKKVPVP 7178 

Qy 533 QEACESELNEATGTKIAYETKVDL VQTSEAIQESLYP 569 

II :: I I :: I I I : : : I :: I 

Db 7179 P AKK P EAP P P KVP EAP - KEWP E KKVP VP P P KK P EVP P T KVP EVP KAAVP E K KVP EAI P P 7237 

Qy 57 0 TAQLCPS — FEEAEATPS PV-LPDIVMEAPLNSLLPSAGASWQPSVSPLE 617 

: I III I :M II :|:: I :|:| :| |:|:: 

Db 7238 KPESPPPEVFEEPEESPSAPPKKPEVPPVRVPEVPKEWPEKKVPAAPPK — KPEVTPVK 72 95 

Qy 618 AP PPVSY DSIKLEPENPPP — YEE 639 

I II : : | : | | : | | | : I I 

Db 7296 VPEAPKEWPEKKVPVPPPKKPEVPPTKVPEVPKVAVPEKKVPEAIPPKPESPPPEVFEE 7355 

Qy 640 AMNVALKALGTKEGI KEPESFNAA — - VQETEAPYISIACDLIKETKLSTEP 688 

III: I : : I I I II I I : I I : : I : : I 

Db 7356 PEEVALEE-PPAEWEEPEP — AAPPQVTVPPKNPVPEKKAPAV VAKKPELPPVK 74 07 

Qy 689 SPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESL 748 

I : I : I I I 1 I I I I : I : I M I II : 

Db 7408 VPEVP-- KEWP-EKKVP LWPKKPEAPPA KVPEVP KEW 7444 



Qy 749 TEVS ETVAQHKEERLSAS PQELGKP YLES FQPNLHSTKDAASNDI PTLTKKEKI S LQMEE 808 

I I : I : I : I I I I : I : : I I : I : I : : I 

Db 7445 PEKKVAVPKKPEVPPAKVPEVPKKPVLEE-KPAVPVPERAESPPPEVYEEPEEIAPEEE- 7502 



Qy 809 FNTAIYSNDDLLSSKEDK IKESE TFSDSSPIEIIDEFPTFVSAKDDSPKLA 859 

: : : I : I : I I i : I I I I I : : I 

Db 7503 IAPEEEKPVPVAEEEEPEVPPPAVPEEPKKIIPEKKVPVIKKPEAPP-P 7550 



Qy 860 KEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFK-NI YPKDEVHVSDEFSENRSSVSK 918 

II : I : I : : II : | : I I : I II I 

Db 7551 KEPEPEKVIEKPKLKPRPPPPPPAPPKEDVKEKIFQLKAIPKKKV PENPQVPEK 7604 



Qy 919 ASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLP — SDTEKEDRSLSAVL 967 

I I I 1:111 I I : I : I I 

Db 7605 VELTPLKVPGGE KKVRKLLPERKPEPKEEWLKSVL 764 0 



RESULT 8 

T26213 

hypothetical protein W06A7.3b - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text__change 15-Oct-1999 
C;Accession: T26213 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A; Accession: T26213 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-222 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN: CAB01523 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3b 
A; Experimental source: clone W06A7 
C; Genetics : 

A; Gene: CESP : W06A7 . 3b 
A;Map position: 5 
A;Introns: 27/1; 77/2; 201/2 



Query Match 5.5%; Score 320; DB 2; Length 222; 

Best Local Similarity 32.1%; Pred. No. le-06; 

Matches 59; Conservative 47; Mismatches 78; Indels 0; Gaps 0; 

Qy 976 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 1035 

: : I : : I I I I I I : : I : I : I I : : : : I I I I I I : : I I I 

Db 27 I L D VI YW RDAK K S AI VL S LAL L VL FVLAK Y P L L TWT Y S L L LAL GAAAG F RVF K KVEAQ I 86 

Qy 1036 QKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVL 1095 

: I : I III I : : : : I I : : : I : I : : I I : : : I : I I : : 

Db 87 KKTDSEHPFSEILAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESPLESIKFGLV 14 6 

Qy 1096 MWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKI 1155 

: I I I : : I : I I I I I I : : I I : I : I I : I I I : I : : I : I . I : 

Db 147 LWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGHLKNVQNIIDEKL 206 



Qy 1156 PGLK 1159 

I I : 

Db 207 PFLR 210 



RESULT 9 
A47283 

calphotin - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 25-Apr-1997 #text_change 21-Jul-2000 
C;Accession: A47283 

R;Ballinger, D.G.; Xue, N.; Harshman, K.D. 

Proc. Natl. Acad. Sci. U.S.A. 90, 1536-1540, 1993 

A; Title: A Drosophila photoreceptor cell-specific protein, calphotin, binds 
calcium and contains a leucine zipper. 

A; Reference number: A47283; MUID : 93165730 ; PMID: 8434015 

A; Contents: photoreceptor cells 

A;Accession: A47283 

A; Status: preliminary 

A;Molecule type: nucleic acid 

A; Residues: 1-873 <BAL> 

A; Cross-references: GB:L05080; NID:gl57071; PIDN : AAA28420 . 1; PID:gl57072 
A;Note: sequence extracted from NCBI backbone (NCBIN : 124 958 , NCBIP : 124959 ) 
C; Superf amily: collagen alpha 1(1) chain; fibrillar collagen carboxyl-terminal 
homology; von Willebrand factor type C repeat homology 

Query Match 5.2%; Score 304.5; DB 2; Length 873; 

Best Local Similarity 21.9%; Pred. No. 3.1e-05; 

Matches 213; Conservative 126; Mismatches 362; Indels 273; Gaps 42; 

Qy 62 AAGLSAAAVPPAAAAPLLDFSSDSVPPAPR— GPLP7UVP PAAP ERQPSWER 110 

: I : : I I I I I I : I : I II I I I I I III I I : 

Db 11 SAPVAAPVTPSAVAAPVQWSPAAVAVAPAWAPAPAAPTAVTPVAPPPTLASVQPATVT 70 

Qy 111 SPAAPAP SLPPAAAVLPSKLPEDDEPPARPPPPPP AGASPLAEPAAPP 158 

I I I I I I : I I : I I : III II I :: I : I I I 

Db 71 VP-APAPIAAASVAPVASVAPPWAAPTPPAASPVSTPPVAVAQIPVAVSAPVAPPVAAT 129 

Qy 159 STPAAPKRRGSGSVDETLFALP — AASEP VIPSSAEKIMDLMEQPGNT VSSGQ 209 

I I I I I : I I I I I I I I : I III I : : 

Db 130 PTPVAPI PVAAPVIATPPVAASAPTPAAVTPWSPVIATPPWPANTTVPVAAPV 184 

Qy 210 EDFPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERA 268 

I: : I I I : : : I I :: : : II I : |: I 

Db 185 AAVPAAVPWAPVLAPAVAPAVAPWAETPAPPPVAEIPVA — TIPECVAPLIPEVSWA 242 

Qy 269 TNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKEDLVCSAALH 328 

I I I I : I 11:1 

Db 243 TKPLA— AAE PVWAP PATET P WAP AAA 269 

Qy 329 S PQES PVGKEDRWS PEKTMDI FNEMQMS WAPVREEYADFKPFEQAWEVKDTYEGSRDV 388 

II I I : I : : : | | | | | 

Db 270 SPHVS VAP AVETAWAPV 287 

Qy 389 LAARANVESKVDRKCLEDSLEQKSL GKDSEGRNEDASFPSTPEPVKDSSRAYITCA 444 

I : I I I : | : I : | : I I I I : : : 

Db 288 S AS T E P P VAAAT LT T AP ET P ALAP WAE S QVAANT WAT PPT PAP E P ET I AP P WAE 344 

Qy 445 SFTSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEA 504 

: I : : I I : : : I : I I I : I I I : 



Db 



345 T P EVAS VAVAETT P PWP PVAAE S 1 PAP WAT T P VPAT LAVT D P D - 389 



Qy 505 DYWTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQ 564 
Db 390 VTAS AVP E L P P VI AP S P VP S A VAETPVDLA P 420 

Qy 565 ESLYP-TAQLCPSF — EEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAP— 619 

II I : I : II 11:1 : : I I : : I : I I : I I I I 

Db 421 P VL P PVAAE P VP AWAEET PET PAP AS AP VT I - AALD — I P EVAP VI AAP S DAP AEAP S A 477 

Qy 620 — PPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACD 677 

III II |:|||||::: 

Db 47 8 AAPIVS TPPT TAS VP ETTAP PAAVPT E 504 

Qy 678 LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE — LVEDSSPESEPVDLFSD-DS 729 

:: I : I : I : I : I : : I I I I :: I I : II I 

Db 505 P I DVS VL S EAAI ET P VAP P VEVT T EVAVADVAP P EAAE D L 1 1 E P VE P PAP I P DL L EQTT S 564 

Qy 730 IPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAA 789 

: I I : : : : I I : I I I :: : I : I I . : I : I : : 

Db 565 VPAVEAAESTSSPIPETSLPPPNEAVA SPEVAVAPITAPEPIPEP-EPSLATPTEPI 620 

Qy 790 SNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFV 849 

: I :: I :: I : : I I I I I I 

Db 621 PVEAPV VI Q EAVDAVEVP VT ET S T S I P ET T VE F P EAV 657 

Qy 850 SAKDDSPKLAKEYTDLEV-SDKSEIANIQSGAD SLPCLELPCDLSFKNIYPKDEVH 904 

: I I : | : | : : : : | M | | : I : I : I 
Db 658, AEKVLDPAI TEAPVTTQEPDVANINDGAPATEITTPAVEI VT 699 

Qy 905 VSDEFSENRSSVSKASISPSNVSALE-PQTEM— GSIVKSKSLTKEAEKKLPSDTEKEDR 961 

: I I : : : I : : I I : I I : : : I I I : I : : : 

Db 700 AAAE VS DT AI P L I D P P V- P QE I AVAE I P ET ET K PAEVI VEQ S - T I P I EAP VP E VS K YAE P 757 

Qy 962 SLSAVLSAELSKTS 975 

: I : I I : I : 
Db 758 VISEAPAAEVPITA 771 



RESULT 10 
T24583 

hypothetical protein T06D8.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T24583 
R; Palmer, S. 

submitted to the EMBL Data Library, April 1995 
A; Reference number: Z19909 
A;Accession: T24583 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-1829 <WIL> 

A; Cross-references: EMBL:Z49130; PIDN : CAA88 964 . 1; GSPDB : GN00020 ; CESP:T06D8.1 

A; Experimental source: clone T06D8 

C; Genetics: 

A; Gene: CESP:T06D8.1 

A;Map position: 2 



A;Introns: 1391/3; 1432/3; 1470/3; 1505/1; 1520/1; 1616/1; 1644/1; 1687/3; 
1742/1 



Query Match 5.2%; Score 302.5; DB 2; Length 1829; 

Best Local Similarity 21.1%; Pred. No. 0.00011; 

Matches 233; Conservative 157; Mismatches 469; Indels 245; Gaps 38; 

Qy 2 EDIDQSSLVSSSTDSPPR PPPAFKYQFVTEPEDEE DEEEE E 42 

I : : : I I I : I I | IN I : | | I 

Db 212 EETTWAWESSGEEPASSSTSIPTELSKNDQVTEASGEETITAAATEASEETTTSAVTE 271 

Qy 4 3 DEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRG PLPAAPP 99 

II : :|: :||: |: ::| : I I I II 

Db 272 GSGEDTTWAWELSGEQPAS — SSTSIP TELSKDDQVTEASGEETTTAAATE 322 

Qy 100 AAPERQPS WERSPAAPAPSLPPAAAVLPSKLPEDDE-PPARPPPP 143 

I : I I II III : : | :: | : | | : I 

Db 323 ASEETTTSAVTEGSGEETTWAWESSGEEPASS STSIPTELSKDDQVTEASGEET 378 

Qy 144 P P AGAS P LAE P AAP P S T P AAP KRRG S G S VD ET L FAX PAASEPVIPSSAEKIMD 196 

I I : : I I : II I I I : I : I I : I II: I 

Db 379 TTAAATEASE ETTTSAVTEGSGE-DTTWAWESSGEQPASSSTSIPTELSKDDQ 432 

Qy 197 LMEQPGN TVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEH 237 

: I I III:: : : I : : I : I I : 
Db 433 VTEASGEETTTAAATEASEETTTSAVTEGSGEDTTWAWESSGEQPASSSTSIPT- 488 

Qy 238 GYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPK 297 

I I : : I 111 I : I I I I I : : : : I I : 

Db 489 -ELSKDDQVTEASG — EETTTAAATEASEETTTSAVTEGSGEETTV-VAWESSGEEPAS 544 

Qy 298 GESAILVENTKEEVIVRSKDKEDLVCSAALHSPQESPV GKEDRWS PEKTMDI 350 

: : I I : I : : : : : I : I I : I : I : I I I : : : I 

Db 545 SSTSIPTELSKDDKVTEASGEETTTAAATDASSEETTTSAVTEGSGEETTWAWESSD- 603 

Qy 351 FNEMQMSVVAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQ 410 

I I : I I : I : I I II I III: 

Db 604 -EEPASSSTSIPTELSKDDQVTEASGEETTT AAATEASEETTTSAVTEGSGEE 655 

Qy 411 KSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 470 

: : I I : : II I : I : I I I hill : I : : 

Db 656 TTWAWESSGEEPASSSTSIPTELSKDDKVTEA SGEETTTAAATDASSEETTTSAV 712 

Qy 471 DEKKIEERKAQIITEKTSPK-TSNPFLVAVQDSELADYVTTDTLSKVTEAAVSNMPEGLTP 529 

I I I : | : : | : : : I : I I I : : I I I : II 

Db 713 TEGSGEETTWAWESSDEEPASSSTSIPTELSKDDQVTEASGEETTTAAATEASEETTT 772 

Qy 530 DLVQEACESELN EATG TKIAYETKVDLVQTSEAIQESLYPTAQLCPSF 577 

II I I : : I III 11:111 I : 

Db 773 SAVTEGSGEETTWAWESSGEEPASSSTSIPTELSKD-DQVTEASGEE TTTAAATE 82 8 

Qy 57 8 EEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPY 637 

I I I I : : : : : I : I I I : I I : I 
Db 829 ASEETTTSAVTEGSGEDTTWAWESSGE QPASSSTSIPTELS 871 

Qy 638 EEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSE 697 

I : : I : : I I : I : : I I I : 



Db 



872 



KDDQVTEASGEETTT AAATEASEETTTSAVTEGSGEETTV 911 



Qy 698 IAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEV PQTQEEAVMLMKESLTEVSET 754 

: I II II I I I I : I I I : I ! I I I : I 

Db 912' VAWESSGEEPA SSSTSI PTELSKDDQVTEASGEETTTAAATEASSEETTTSAVT 966 

Qy 755 VAQHKEERLSASPQELGKPYLESFQP NLHSTKDAASN 791 

:| II : I: I I : I" — ! 

Db 967 EGSGEETTTSAVTEGSGEETTTSAVPEGENSTTEAPAFVTGSEIEIPSSEESSSTTTHDP 1026 

Qy 792 DIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESET-FSDSSPIEIIDEFPTFVS 850 

I I : I I : I : I : I : : : : : I I : I I : I I III: 
Db 1027 SIPVITPKPSVSSTIENVMSKTSSEE AAEKKI I GEHQTGKDDDAGKEDEDNMPAFVT 1083 

Qy 851 A KDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 891 

I : I : : I : I I I : I : : : I 
Db 1084 ANPAGTSTTESAENWSTGEEDENIKMAKELGKQFAADLAKLA 1126 

Qy 892 LSFKNIYPKDEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLTKEAEKK 951 

I | I ::: I :: : I | : I I : I : : I I : :: I I :: 

Db 1127 AKDGVNLT-ETADAKDSGETAHVEDEQVSSTE — SSIGSEETTTTVNKETTEE 1176 

Qy 952 LPSDTEKEDRSLSAVLSAELSKTS 975 I 

■ : : I I : : I I I : 

Db 1177 HHEASGEEDDAPAFVTGAPTDSTT 1200 



RESULT 11 
A47282 

calcium-binding protein calphotin - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 18-Nov-1994 #text_change 21-Jul-2000 
C;Accession: A47282 

R;Martin, J.H.; Benzer, S.; Rudnicka, M. ; Miller, C.A. 
Proc. Natl. Acad. Sci. U.S.A. 90, 1531-1535, 1993 

A; Title: Calphotin: a Drosophila photoreceptor cell calcium-binding protein. 

A; Reference number: A47282; MUID : 9316572 9 ; PMID: 8094559 

A/Accession: A47282 

A; Status: preliminary 

A;Molecule type: nucleic acid 

A; Residues: 1-865 <MAR> 

A; Cross-references: GB:L02111; NID:gl57031; PIDN : AAA284 05 . 1 ; PID:gl57032 
A; Experimental source: photoreceptor cells 

A;Note: sequence extracted from NCBI backbone (NCBIN : 124955, NCBIP : 124956) 

C; Genetics: 

A; Gene: FlyBase:Cpn 

A; Cross-references : FlyBase : FBgn0010218 

C;Superfamily: collagen alpha 1(1) chain; fibrillar collagen carboxyl-terminal 
homology; von Willebrand factor type C repeat homology 
C; Keywords: calcium binding 

Query Match 5.1%; Score 295.5; DB 2; Length 865; 

Best Local Similarity 21.3%; Pred. No. 7.2e-05; 

Matches 217; Conservative 127; Mismatches 379; Indels 295; Gaps 41; 

Qy 62 AAGL S AAAVP P AAAAP L L D F S S D S VP PAP RG P L P AAP P AAP E R QPSWERSPAAPAP 117 

: I : : I Mill: I : I I I I I : III I I : I I I I I 



Db 



11 SAPVAAPWPSAVAAPVQWSPAAVAPAPT^PIAVTPVAPPPTLASVQPATVTIP-APAP 69 



Qy 118 SLPPAAAVLPSKLPEDDEPPARPPPPPPAGA SPLAEPAAPPSTP 161 

I : I I : I I : III III : I : I I I II 

Db 70 IAAASVTPVASVAPPWAAPTPPAASPVSTPVAVAQIPVAVSAPVAPPVAATPTPWQIP 129 

Qy 162 -AAPKRRGSGSVDETLFALP — AASEP VI PSSAEKIMDLMEQPGNT VSSGQED 211 

III : I I II I I I I : I I I I I :: 

Db 130 VAAP VI AT P PVAAS APT PAAVT PVI S PVIAS P PWPANTTVPVAAPVAA 178 

Qy 212 FPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATN 270 

I: : II |:::| I :: : : || I : |: II 

Db 17 9 VPAAVPWAPVLAPAVAPAVAPWAETPAPPPVAEIPVA — TIPECVAPLIPEVSWATK 236 

Qy 271 PFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKEDLVCSAALHSP 330 

I I I : I I I : I II 

Db 237 PLA AAE P VWAP PAT ET P WAPAAAS P 263 

Qy 331 QESPVGKEDRWSPEKTMDIFNEMQMSWAPVTCEEYADFKPFEQAWEVKDTYEGSRDVLA 390 

I I : I : : : I I I I I 
Db 264 HVS VAP AVETAWAPV — 27 9 

Qy 391 ARANVESKVDRKCLEDSL'EQKSL — - - GKD'S E GRN E DAS FP S T P E P VKD S S RAY I T CAS F 446 

I: I I I : I :| : I : I I I I :: : : 

Db 28 0 - S AS T E P P VAAAT LT TAP ET P ALAP WAE S Q VAANT WAT PPT PAP E P ET I AP P WAET P 338 

Qy 447 TSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADY 506 

I : : ||:: : I : I I I : I I I : 
Db 339 EVASVAVAETTPPWPPVAAES 1 PAP WAT T P VP AT LAVT D P D 381 

Qy 507 WTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES 566 

I I : I I : I : I I I I I I I I 

Db 382 VT AS AVP E L P P VI AP S P VP S A VAETPVDLA PPV 414 

Qy 567 LYP-TAQLCPSF— EEAEATPSPVLPDIVMEAPLNS LLP SAGAS WQPSVSPLEAP 619 

II I : I : II I I : I : : I I : : I : I I : I I I I 

Db 415 L P P VAAE P VP AWAEET P ET P APAS APVT I - AALD — I P EVAP VI AAP S DAPAEAP S AAA 471 

Qy 620 PPVSYDSIKLEPENPPPYEE7\MNVALKALGTKEGIKEPESFNAAVQETEAPYISIACD — 677 

I I I II | : | | | I I :: : 

Db 472 PIVS- TPPT TASVPETTAPPAAVPTEPI 498 

Qy 678 LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE — LVEDSSPESEPVDLFSD-DSIP 731 

: : I : I : I : I : I : : M I : : I I : I I hi 

Db 499 DVS VL S EAAI ET PVAP P VEVTT EVAVADVAP P EAAADL HE PVE P PAP I PDLLEQTT S VP 558 

Qy 732 EVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASN 791 

I : : : : I I : I II : : : I - : I I : I : I : : 

Db 559 AVEAAESTSSPIPETSLPPPNEAVA SPEVAVAPITAPEPIPEP-EPSLATPTEPIPV 614 

Qy 792 DIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSA 851 

: I : : I : : I : : I I I I I I : 

Db 615 EAPV VI Q EAVDAVEVP VT ET S T S I P ET T VE F P EAVAE 651 

Qy 852 KDDSPKLAKEYTDLEV-SDKSEIANIQSGAD- SLPCLEL PCDLSFKNIYP- 899 

I I : I : I : : : : I I I I I : I : I : I : : I I 

Db 652 KVLDPAI T EAP VTT Q E P D VAN I N DGAP AT E I T T PAVE I VT AAAE VS D I AI PVI DPP 707 



Qy 90G - KDEVHVS DE FS ENRS S VS KAS I S PSNVS ALE PQT EMG 936 

I : I : I I : : : : I : I I : 

Db 708 VPQE I AVA- E I P ET DT KP AEVI VEQS T I P I EAP VP EVS K YAE P VI S EAP AAE VP I TAGDN 766 

Qy 937 SIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGV 990 

I : I : : III: I I : I : : I I I I : : I I 

Db 767 PDNTSVGISEVVPTIAEKPVEEVPTSEIPEQSSSPSDSVPVAKITPLL — RDLQTTDV 822 



RESULT 12 
T34513 

hypothetical protein ZK783.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 29-Oct-1999 
C;Accession: T34513 
R;Favello, A.; Vaudin, M. 

submitted to the EMBL Data Library, August 1994 

A; Description : The sequence of C. elegans cosmid ZK783. 

A; Reference number: Z21536 

A;Accession: T34513 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-3507 <FAV> 

A;Cross-references: EMBL:U13646; PIDN : AAC24418 . 1 ; GSPDB: GN00021; CESP:ZK783.1 

A; Experimental source: strain Bristol N2; clone ZK783 

C; Genetics: 

A; Gene: CESP:ZK783.1 

A;Map position: 3 

A;Introns: 14/1; 48/2; 84/1; 196/3; 303/1; 381/1; 586/1; 605/1; 1175/3; 1207/1; 
1409/2; 1450/1; 1760/1; 1898/1; 2153/1; 2628/1; 2681/1; 2899/1; 2946/1; 3088/3; 
3184/1; 3283/1; 3346/3; 3365/3; 3484/3; 3504/1 

Query Match 5.0%; Score 292; DB 2; Length 3507; 

Best Local Similarity 20.6%; Pred. No. 0.00072; 

Matches 226; Conservative 146; Mismatches 491; Indels 236; Gaps 37; 

Qy 11 SSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDL — EELEVLERKPAAGLSAA 68 

I I I : : : I II : : I I : : I I I I : I I I : I 

Db 2043 SSSSEAPLTSSPATTTEVITESSVKSTTPKEESSSEITVKLSSKSPEVTESSVKSSPSTP 2102 

Qy 69 AVP P AAAAP L LD F S S D S VP PAP RG P L P AAP P AAPERQPSWERSPA APAPSL 119 

: : : : I I : I : : I : I : I I I : I I 

Db 2103 STTSQSVTSTVPETSKSTVLSSEAPVTSTSPTEVHTSSETKPSLSASSTTGDTNSTTPST 2162 

Qy 120 PPAAAVLPSKLPEDDEPPARPPPPPPAGASP-LAEP AAPPSTPAAPKRRGSGSV 172 

I : I : II I I : | | ::: | I III: II 

Db 2163 SSLASVKSTSAPEGTS — AS VAP VKL S S L S P DVS Q P S T KT FDAT ESS T VQAS ET S S GT S V 2220 

Qy 173 DET LFALPAASEPVI-PSSAEKIMDLMEQPGNTVSSGQEDFP SV 215 

I : I : : I II I I : : I I I : I I I I I I 

Db 2221 KSTSEPESHVTKLSITSSNPSSSVPVTSPKSTPTVPESTEQPTSTTPSGQSLTPMNSNSE 2280 

Qy 216 LLETA— -ASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERAT— N 270 

: I I : I I I I : I I I I : : : I : I I I I : 

Db 2281 VLTTSEPHVLSSSLSPDVSQSSTTPNNLSESSTVETPKTSSEVSLNSEEPSTTEAPTTLS 2340 



Qy 271 PFV NRDLAEFSEL EYSEMGSSFKGSPKG ESAI LVENTKEEVIVRSKD 317 

I : : I : : I : : I I : I I I I : : : I I : 

Db 2341 PDILSTTTNNLSQSSTVSTEDRSEISSENSEKPTSAPELVTSSVTHVASSSPDVPTESSE 2400 

Qy 318 KEDLVC SAALH S PQ E S P VGK E D RWS P E KTMD I FN EMQM S WAP VRE E YAD FK 370 

. : I I I : : I : I : I I I : I : I I : 

Db 2401 PDDLTGSSTENIPEASSKQTISSTPTPDTTTASEEPTKSTSMSPDLSTTSNVLSESSTTP 2460 

Qy 371 PFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTP 430 

'1:11 : : III : II | | : : I I 

Db 2461 ESSSKSPVSSSTEG ISWTSTEFSKVPESTISSVLE EDLT-KTTP 2504 

Qy 431 EP VKD S S RAYI T CAS FT SAT ESTTANTFPLLEDHTSENKTDEKKIEE 477 

I : : : I I I I I III:: I : : : : I : 

Db 2505 SPILEET TTASETSEPLTEDSLTVSVRIHELTTSSENVPKESESTTTSSESSKPSQ 2560 

Qy 478 RKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGL TPDLV 532 

I hi II: I: : II :|::| I :: I: I :l I 

Db 2561 EPAGILTSTVWPTSSVSLITASEIEA — ITSNTPFKQGRTPITTSPKSLVKSTTSPSTV 2618 

Qy 533 — QEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPD 590 

III | :: | : |||:: : |: |: |:: :| I 

Db 2619 TSSEPSESTKRTTVSTTVSTTTPTEETTTSESLILTAAPSK — -PTESTTESSEAPTTP- 2674 

Qy 591 IVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGT 650 

I I : I I ||::: 
Db 2675 AKTSETKPS NVSSTSRKS 2692 

Qy 651 KEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE 710 

I :: I : ::: : I I I I : : : II I 

Db 2693 TENVETSTSQSGSLESS TMSSTSSEPETNAPAVTVSSEASSTTLEE 2738 

Qy 711 LVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKE-ERLSASPQE 769 

I I I I : II III: : | : : : ' I : I : I : I I 

Db 2739 NSSTSSPTSSEASVKLSSLFPE — SITSEAVTVSSRAPAEITMSSESHREISTVSSEPSE 2796 

Qy 770 LGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSS 822 

I : I I : : I : I I : I I : I I I : I 

Db 2797 PEIPLSTTVSPNWTASSIPSEE-PILSSVTSSSTPRVRLITG— TPDDLIVSVTVPSHG 2853 

Qy 823 -KEDKIKESETFSDS-SPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGA 880 

: II hi -III : I I : III II 
Db 2854 NRRQNITASSVPSNSTSPIILPSESLTTPQPPPTTTTTAKPAT TSGK 2900 

Qy 881 DSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVK 940 

I ::. I :: Mill I 
Db 2901 RGPPSIQPPAEM FTTPAPPPPSNGGYGEE 2929 

Qy 941 SKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLL 1000 

I : I : : : I I I I I : : : I : I I I I : 

Db 2930 TNQEEEQVTSTTTTEAPSLCSTVTCHSLATCE QSTGVCICRDGFIGD 2976 

Qy 1001 SLTVFSIVSVTAYIALALL 1019 

III I : I I 

Db 2977 GTTACSKKSTADCISLPSL 2995 



RESULT 13 
A56577 

microtubule-associated protein MAP IB - rat (fragment) 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 21-Jul-1995 #sequence__revision 21-Jul-1995 #text_change 16-Feb-1997 
C;Accession: A56577 

R;Zauner, W. ; Kratz, J.; Staunton, J.; Feick, P.; Wiche, G. 
Eur. J. Cell Biol. 57, 66-74, 1992 

A;Title: Identification of two distinct microtubule binding domains on 
recombinant rat MAP IB. 

A/Reference number: A56577; MUID : 9234 7374 ; PMID: 1639092 

A;Accession: A56577 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-2364 <ZAU> 

A;Cross-references : GB:X60550 

A; Experimental source: brain 

A;Note: nucleotide sequence not given; conceptual translation not complete 
C; Super family : microtubule-associated protein MAPlB 

Query Match 5.0%; Score 291.5; DB 2; Length 2364; 

Best Local Similarity 20.0%; Pred. No. 0.00043; 

Matches 220; Conservative 167; Mismatches 456; Indels 255; Gaps 44; 

Qy 30 TEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPA 89 

: I I I I : I : : I I | : | : : : | I II III:: I : 
Db 913 SEEEGEEEEDKAEDAREEDHEPDKTE AEDYVMAWDKAAEAGVTEDQYDFL 963 

Qy 90 P RGP L P AAP P AAP ERQ P SWERS P AAPAP S L P PAAAVL P S KL P EDDE P P ARP P P P P P AGAS 14 9 

I : I I : I I I I : : : II II II 
Db 964 GTPAKQ PGVQSPSREPASSIHDETLPGGSESEAT AS 999 

Qy 150 P LAE PAAP P ST PAAP KRRGS GS VDET L FAL P AAS E P VI P S S AEK IMDLMEQ P GN TVS 206 

I I II I : : I I I I : I III 

Db 1000 DEENREDQPEEFTAT SGYTQST IEISSEPTPMDEMSTPRDVMTDETNNEETES 1052 

Qy 207 SGQE DFPSVLLETAASLP SLSPLS TVS FKEHGYLGNLSAVS S SEGTI E 25.4 

II : I I II I : I I I : I : I : I I 

Db 1053 PSQEFVNITKYESSLYSQEYSKPWASFNGLSDGSKTDATDGRDYNASASTISPPSSMEE 1112 

Qy 255 ETLNEAS KELPERATNPFVNRDLAEF — SELEYSEMG 289 

: : : : : I : : : : | : | : | : | : : | 

Db 1113 DKFSKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERS 1172 

Qy 290 S S FKGSP KGE S AI LVENT KEEVI VR SKDKEDL VCSAALHSP 330 

: I I : I I : : I : : I I : : : I I : I I : I 

Db 1173 VNFSLTPNEI KASAEGEATAWS PGVTQAWEEHCAS PEEKTLEWS PSQSVTGSAGHTP 1232 

Qy 331 -QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFK PFEQAWEVKD 380 

: I I : : : I I : : I I I : : I I : : II 

Db 1233 YYQSPTDEKSSHLPTEVT ENAQAVPVS FEFTEAKDENERS SIS PMDE — PVPD 1283 

Qy 381 TYEGSRDVLAARANVESKVDRKCLED — SLEQKSLGKDS EGRNEDASFPSTPEPVK 434 

: II: : I I I : I : I I : I I I : I I II 

Db 1284 SESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSDKESPVS 1343 



Qy 



435 DSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 



DEKKI 



475 



Db 1344 D LTSDLYQDKQEEKRAGFIPIKEDFSPEKKASDAEIMSSQSALALDERKLGGDGS 1398 

Qy 476 EERKAQ 1 1 T EKT S P KT SN P FLVAVQD S EAD YVTT DTL S KVT EAAV — 520 

1:1 I I i :: I : I : I : :: | : I : I 
Db 1399 PTQVDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGAEDT YSHMEGVASVSTASVAT 1455 

Qy 521 SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES-LYPTAQLCP- — S 576 

I : I I III : : I :. - I : : I : I M I I : : I : : I I I 

Db 1456 SSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMS 1514 

Qy 577 FEEAEATP SPVLPDIVMEAPLNSLL PSAGASWQ 610 

: : I I : : I : I : I I I : I I :: 

Db 1515 ISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHPTVGAGMLH 1574 

Qy 611 PSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFN 661 

: : III I I : I | I : : : : : : : | : : 

Db 1575 ITEN GPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLS-ELISVSQVEASPSTSS 1630 

Qy 662 AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 721 

I I I I : : I | | | | : : | : | : | | | : | : 

Db 1631 AHTPS QIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKL SPKSDI 168 0 

Qy 722 VDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL-ESFQP 780 

I : I I : : I I I I : I : : : I I I : : I 

Db 1681 SPLTPRESSPTYSPGFSDSTSGAKES-TAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQH 1739 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

: I : : I : : : : I : : I I I : I : : : : I I I : 

Db 174 0 HLALSRDLTTSSV EKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYE — SHEKTIQ 1793 

Qy 841 IIDEFPTFVSAKDDSPK LAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I : : : I I : I : I : : I I I ::: I 

Db 1794 AHDVGGYYYEKTERTIKSPCDSGYSYETIEKTTKTP EDGGYS CEITEKT 1842 

Qy 897 IYPKDEVHVSDEFSENRSSVSKAS — ISPSNVSALEPQTEMGSIVKSKSL 944 

: I I I I I : : I : : : : I : I 

Db 1843 TRTPEEGGYSYEISEKTTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSY 1902 

Qy 945 TKEAEKKLPSDTEKEDRS 962 

: I * : I : I II I 
Db 1903 SYETTEKITSFPESESYS 1920 



RESULT 14 
T19431 

hypothetical protein C25A1.10 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 21-Jan-2000 
C;Accession: T19431 
R;Mortimore, B. 

submitted to the EMBL Data Library, October 1996 
A; Reference number: Z19124 
A;Accession: T19431 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-971 <WIL> 



A; Cross-references : EMBL: Z 8 1038; PIDN : CAB02755 . 1; GSPDB : GN00019 ; CESP : C25A1 . 1 0 

A; Experimental source: clone C25A1 

C; Genetics: 

A; Gene: CESP : C25A1 . 10 

A; Map position: 1 

A;Introns: 38/3; 92/3; 201/3; 919/3 

C; Super family : nucleolus-cytoplasm shuttle phosphoprotein 

Query Match 4.9%; Score 288.5; DB 2; Length 971; 

Best Local Similarity 20.6%; Pred. No. 0.00016; 

Matches 198; Conservative 127; Mismatches 391; Indels 247; Gaps 35; 

Qy 7 SSLVSSSTDSPPRPPPAFKYQFVTEP — -EDEEDEEEEEDEEEDDEDLEELEVLERKPAA 63 

II Mill: II: : I : : : : : I M I : : : : II 
Db 147 SSSDSDSDDEPPKKAPAWTKVAPKPMAKKQDTSDSDSDSEDSDDGKSKKANPVKVTPVA 206 

Qy 64 GLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAP PAAP ERQPSWE 109 

: I II: III I I : M II I I :: I : 

Db 207 NVLQKVVAKKAASSSSDSSDDEKKPAAK-PTPAKPTPKPVVKKAESSSDSSDDEKKPVAK 265 

Qy 110 RSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGS 169 

: I I I II I Mill: I I I : I : I . I : : : : 

Db 266 PAPAKATPK-PAAKKADSSSDSSDDEAPAK-KTP7VKAAPKPVAKKAESSSDSSDDEKK-- 321 

Qy 170 GSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPL 229 

III I M : : : M : I : : I : : I : 

Db 322 PAAK PTPAKATPKPVAKKAESSSDSSDDEKKPVAKPAPAKATPKPV 367 

Qy 230 STVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSE — 287 

: II I I : : : I I : I I : I I I : I :. I 

Db 368 AK KAESSSDSSDDEK-KPAAKPTPAKATPKPVAKKAESSSDSSDDEKK 414 

Qy 288 MGS S FKGSPKGESAI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS P 344 

- ■ : I M I I : I I I : : I I : I I : I 

Db 415 PVAKPTSAKATPK PAAKKAD S S S DS S DDEAPAKKT PAKAAP 455 

Qy 345 E KTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRD VLAARANV 395 

: I : : I I I : I : I I II 

Db 456 KPASKKAESSSDSSDDEKPAAKSTPAKITPKPTAKKVASSSSDSSDDEKKPAAKPTPANA 515 

Qy 396 ESKVDRKCLEDSL EQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT SAT 450 

I Til |:|:|:: : | | : | : : : 

Db 516 TPKPVAKKAESSSDSSDDEKKPVAKPTSAK ATPKPAAKKADLSSDFSDDEAPA 568 

Qy 451 ESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPK-TSNPFLVAVQDSEADYVTT 509 

Ml I : I : : I I : I : Mill: : I I I 

Db 569 KKTPAKAAPKPASKKAESSSDSSDDEKPAAKSTPAKTTPKPTAKKAASSSSDSSDD — EK 626 

Qy 510 DTLSKVTEAAVSNMPEGLTPDLVQEACESELN-- EAT GT KI AYET KVDLVQT S EAI 563 

: : I I I : I I : : M M I I M I I : I : : 

Db 627 KPVAKPTSAKATPKPAAKKADSSSDSSDDEAPAKKTPVKPTPVKIVAK-KVD — SSSDSS 683 

Qy 564 QESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVS 623 

: I I : I I I I I T: III I 
Db .68 4 DDEKKPT KATPVKVTP KSVTKKAAAS : S 710 

Qy 624 YDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETK 683 



I I : I I : : I I i : I I : : II 
Db 711 SDS — SDDEKKPWKQTPNW PKKEKAASSSDDSS DDEKK 748 

Qy 684 LSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVML 743 

: : I : I I : I : : I I I : I I I I I 

Db 749 PTAKPTP KATPKQSAKKADSSDDS SDDEAPA 779 

Qy 744 MKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKIS 803 

II I I I I I 

Db 78 0 KKT PAKS T PAKT AVKKEAS S 7 99 

Qy 804 LQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS — PKLAKE 861 

I : I I : : | | - : | : | : | : : : I : I I I I : I 

Db 800 SSDDSSDDEKTKKKSATTPAKSTPKTALKKAESSDSSDDDEDLPKPSKA 848 

Qy 8 62 YTDL-EVSDKSEIANIQSGADSLPCLEL-PCDLSFKNIYPKDEVgVSDEFSENRSSVSKA 919 

I : : | | | : : I I : I I : : : I I I : : I 
Db 849 VTPRPQRADSEESAETEESSSRTPALKAKPLATSTEKAVYENRKRKSSPF — RRVQMTKD 906 

Qy 920 SIS 922 

I : I 

Db 907 SVS 909 



RESULT 15 
QRMSP1 

microtubule-associated protein MAP1B - mouse 

N; Alternate names: microtubule-associated protein MAPI (X) ; microtubule- 
associated protein MAPI. 2; microtubule-associated protein MAP 5 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change 01-Sep-2000 
C;Accession: S07549; S44387; A33645 
R; Noble, M. ; Lewis, S.A. ; Cowan, N.J. 
J. Cell Biol. 109, 3367-3376, 1989 

A; Title: The microtubule binding domain of microtubule-associated protein MAP1B 

contains a repeated sequence motif unrelated to that of MAP 2 and tau. 

A; Reference number: A33645; MUID : 90094539 ; PMID:2480963 

A;Accession: S07549 

A; Molecule type: mRNA 

A; Residues: 1-24 64 <NOB> 

A; Cross-references: EMBL:X51396; NID:g52999; PIDN : CAA357 61 . 1 ; PID:g53000 
R; Sanchez, C; Padilla, R.; Paciucci, R. ; Zabala, J.C.; Avila, J. 
Arch. Biochem. Biophys . 310, 428-432, 1994 

A;Title: Binding of heat-shock protein 70 (hsp70) to tubulin. 
A; Reference number: S44387; MUID: 94234720; PMID: 8179328 
A/Accession: S44387 
A; Status: preliminary 
A;Molecule type: protein 
A;Residues: 653-663, 'IC <SAN> 

C; Superf amily : microtubule-associated protein MAP1B 

C;Keywords: microtubule binding; phosphoprotein; tandem repeat 

F; 5 8 9- 7 8 6/ Domain : microtubule binding #status experimental <MTB> 

F; 589-592, 639-642, 649-652 , 655-658 , 660-663, 668-671, 674-677, 679-682, 683-686, 687- 

690,691-694,695-698,699-702,708-711,712-715,716-719,720-723,727-730,758-761,764- 

767,783-786/Region: 4-residue repeats (K/R-K-E/D-X) 

F; 1861-2064/Region : 17-residue repeats 



F;91, 116,351, 888, 1124 f 1153, 1168, 1208 , 1662, 1877, 1918, 2003, 2030, 2054, 2083/Binding 
site: phosphate (Ser) (covalent) #status predicted 

F; 147, 969, 1336, 15 62, 1563, 17 02, 1708, 1990, 2057, 2063, 24 19/ Binding site: phosphate 
(Thr) (covalent) ((status predicted 

F; 1953/Binding site: phosphate (Tyr) (covalent) #status predicted 

Query Match 4.9%; Score 284; DB 1; Length 2464; 

Best Local Similarity 20.7%; Pred. No. 0.00093; 

Matches 233; Conservative ' 159; Mismatches 424; Indels 312; Gaps 51; 

Qy 31 EPEDEEDEEEEEDEEED DEDLEELE VLERKPAAG LSAAAVP 71 

I I I : I I I I I : I I : I I I I : : II I : I 

Db 1009 EAEQS EEEGEEEDKAEDAREEG YEPDKT EAEDYVMAV7U)KAAEAGVTEEQ YGYLGT SAKQ 1068 

Qy 72 PAAAAPLLDFSS DSVPPAPRGPLPAAPPAAPERQP SWERSPAAPA 116 

I : I : : I : : : I I : III I : I 

Db 1069 PGIQSPSREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEISSEPT 1128 

Qy 117 P SLPPAAAVLPSKLPEDDEPPARP P P P P PAGAS P LAE PA — 155 

I I : I: I I:: I I : | : | : 

Db 1129 PMDEMSTPRDVMSDETNNEETESPSQEFVN.ITKYESSLYSQEYSKPAVAS FNGLSEGSKT 1188 

Qy 156 AP P S T PAAP KRRG S GS VDET L FAL PA ASEPVI PSSAEKIMDLMEQ 200 

I I I : I I : : I I : I : I : : I I I : I : : 

Db 1189 DAT DGKD YNAS AS T I S P P SSMEEDKFSKSALRDAYCSEEKELKASAE — LDIKDV 1241 

Qy 201 PGNTVS S GQEDFP S VLLETAAS LP S LS PLSTVS FKEHGYLGNLS AVS S S EGT I EETLNEA 260 

: I I I I I I I I I I : I 
Db 1242 SDERLS PAKSPSLSP SPPSPIEKT 1265 

Qy 261 SKELPERATNPFVNRDLAEFS ELEYSEMGSSFKGSPKGESAILVEN- — TKEEVIVR 314 

111:1 .11 I : : I I : II I : : I : : I I : 

Db 1266 — PLGERSVN FSLTPNEI KVSAEGEARSVS PGVTQAWEEHCAS PEEKTLE 1314 

Qy 315 SKDKEDLVCSAALHS P-QES PVGKEDRWS PEKTMDI FNEMQMS WA- PVREEYADFK — 370 

I : I I : I : I I : I I : : I : : III I : : : I 

Db 1315 WSPSQSVTGSAGHTPYYQSP T DEKS S HL PT EVS ENAQAVP VS FE FS EAKDE 1366 

Qy 371 P FEQAWEVKDT YEGS RDVLAARANVES KVDRKCLED — SLEQKSLGKDS 417 

I :: II: I I : : I I I : I I I : I 

Db 1367 NERASLSPMDE— PVPDSESPVEKVXSPLRSPPLLGSESPYEDFLSADSKVLGRRSESPF 1424 

Qy 418 EGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 470 

I I : I II III: : : I : I : I I III 

Db 1425 EGKNGKQGFPDRESPVSDLT STGLYQDKQEEKSTGFIPIKEDFGPEKKTSDVETMS 14 80 

Qy 471 DEKKI EERKAQIITEKTSPKTSNPFLVAVQDSEAD 505 

I I : I : I : I I I I : : I I : I 

Db 1481 SQSALALDERKLGGDVSPTQIDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGV--AEDT 1538 

Qy 506 YVTTDTLSKVTEAAV — SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAI 563 

I : : : | : | : | I : I I III : : | : | : : I : I II 

Db 1539 YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 1597 

Qy 564 QES-LYPTAQLCP SFEEAEATP SPVLPDIVMEAPLNSLL — 601 

I I : : I : : I I I : : I I : : I : I : I 

Db 1598 QETEMSPSKEECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSFAMD 1657 



602 PSAGASWQPSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVA 64 4 

I: MM: : : | | | | Ml | I : ::: 

1658 FS RQS P DH PTLGAS VLH I TEN GPTEVDYSPCDIQDSSLSHKIPPTEEPSYTQDNDLS 1714 

Qy 645 LKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKS 704 

: : : I : : I I I I : : I I : I M : I 
Db 1715 -ELISVSQVEASPSTSSAHTPS QIASPLQEDTLSDWPPREMSLYASLA 1762 

Qy 705 VPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVML -MKE S LT EVS ETVAQHKE 760 

: I I : III: III II M I = I : I I I I 

Db 1763 SEKVQ — SLEGEKLSPKSDIS PLTPRESSPLYSPGFSDSTSAAKETAAAH — 1810 

Qy 761 ERLSASP QELGKPY LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFN 810 

: I : I I : I I : : I : I : I : : : : I : : I I 

Db 1811 -QAS S S P P I DAATAEP YGFRS SMLFDTMQHHLALNRDLTT S 3V EKDSGGKTPGDFN 18 65 

Qy 811 TAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSPK LAKEYTDLE 866 

I :: I ::: : I I I I : : : I I : I 

Db 1866 YAYQKPENAAGSPDEEDYDYE — SQEKTIRTHDWRYYYEKTERTIKSPCDSGYSYETIE 1923 

Qy 867 VSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKAS 920 

: I : : I I : : : I : I I I I I : : I 

Db 1924 KTTKTP EDGG YTCEITEKTTRTPEEGGYSYEISEKTTRTPEVSGYTYEK 1972 

Qy 921 ISP SNVSALEPQTEMGS I VKS KSLTKEAEKKLP S DTEKEDRS 962 

: : : : I : I : I : I : I I I I 

Db 1973 TERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESESYS 2020 

Search completed: September 3, 2004, 16:09:46 
Job time : 37.8344 sees 



QY 
Db 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 
Perfect score: 5848 



September. 3, 2004, 16:09:02 ; Search time 114.801 Seconds 

(without alignments) 
3191.803 Million cell updates/s 

US-09-830-972-2 



Sequence : 



Scoring table: 



1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKI PGLKRKAD 1163 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1298764 seqs, 315065143 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1298764 



Database 



Published_Applications_AA: * 

/cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 
/cgn2__6/ptodata/ 1/pubpaa/ PCT_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 
/cgn2_6/ptodat a/ 1/pubpaa/US 0 6_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 
/cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: 
/cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep: * 
/cgn2_6/ptodata/l/pubpaa/US0 8_PUBCOMB.pep: * 
/cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep: 
/ cgn2_6/ptoda ta/ 1/pubpaa/US 0 9B_PUBCOMB . pep 
/cgn2__6/ptodata/l/pubpaa/US09C_PUBCOMB.pep 
/cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: 
/ cgn2_6/ptoda ta/ 1/pubpaa/US 1 0A_PUBCOMB . pep 
/ cgn2_6/ptodata/ 1/pubpaa/US 10B_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep 
/cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep: 
/ cgn2_6/ptodata'/ 1/pubpaa/US 60_NEW_PUB. pep: 
/cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep: 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed 
and is derived by analysis of the total score distribution. 
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ID 
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Sequence 18, Appl 
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5307 


90. 
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US-10-267-502-431 


Sequence 431, App 


3 


4403.5 


75. 
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1192 


9 


US-09-758-140-6 


Sequence 6, Appli 
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4403.5 


75. 
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1192 


9 


US-09-972-599A-6 


Sequence 6, Appli 


5 


4403.5 


75. 


3 


1192 


14 


US-10-060-036-71 


Sequence 71 , Appl 


6 


4398.5 


75. 


2 


1192 


9 


US-09-789-386-2 


Sequence 2, Appli 


7 


4398.5 


75. 


2 


1192 


9 


US-09-893-348-23 


Sequence 23, Appl 


8 


4398.5 


75. 


2 


1192 


12 


US-10-267-502-429 


Sequence 42 9, App 


9 


4398.5 


75. 


2 


1192 


16 


US-10-327-213-9 


Sequence 9, Appli 


10 


4398.5 


75. 


2 


1192 


16 


US-10-466-258-9 


Sequence 9, Appli 


11 


4389.5 


75. 


1 


1192 


12 


US-10-408-967-7 


Sequence 7, Appli 


12 


1513 


25. 


9 


379 


14 


US-10-205-194-164 


Sequence 164, App 


13 


1411.5 


24. 


1 


360 


9 


US-09-893-348-20 


Sequence 20, Appl 


14 


1191 


20. 


4 


373 


9 


US-09-789-386-6 


Sequence 6, Appli 


15 


1191 


20. 


4 


373 


9 


US-09-765-205-6 


Sequence 6, Appli 


16 


1191 


20. 


4 


373 


9 


US-09-893-348-24 


Sequence 24, Appl 


17 


1191 


20. 


4 


373 


12 


US-10-408-967-8 


Sequence 8, Appli 


18 


1191 


20. 


4 


373 


14 


US-10-060-036-72 


Sequence 72, Appl 


19 


1183 


20. 


2 


373 


16 


US-10-466-258-4 


Sequence 4, Appli 


20 


925 


15. 


8 


199 


9 


US-09-893-348-21 


Sequence 21, Appl 


21 


908 


15. 


5 


199 


9 


US-09-893-348-25 


Sequence 25, Appl 


22 


908 


15. 


5 


199 


12 


US-10-660-946-1 


Sequence 1, Appli 


23 


901 


15. 


4 


199 


12 


US-10-408-967-9 


Sequence 9, Appli 


24 


899 


15. 


4 


199 


12 


US-09-978-360A-467 


Sequence 4 67, App 


25 


888 


15. 


2 


289 


9 


US-09-789-386-4 


Sequence 4, Appli 


26 


868 


14. 


8 


199 


16 


US-10-466-258-11 


Sequence 11, Appl 


27 


801 


13. 


7 


111 


14 


US-10-205-219-93 


Sequence 93, Appl 


28 


792 


13. 


5 


780 


12 


US-10-267-502-432 


Sequence 432, App 


29 


789.5 


13. 


5 


776 


12 


US-10-660-946-5 


Sequence 5, Appli 


30 


789.5 


13. 


5 


116 


12 


US-10-267-502-430 


Sequence 430, App 


31 


716.5 


12. 


3 


356 


12 


US-10-660-946-6 


Sequence 6, Appli 


32 


706.5 


12. 


1 


593 


15 


US-10-108-260A-2 892 


Sequence 2892, Ap 


33 


685 


11. 


7 


208 


12 


US-10-660-946-7 


Sequence 7, Appli 


34 


671 


11. 


5 


267 


12 


US-10-660-946-8 


Sequence 8, Appli 


35 


671 


11. 


5 


267 


14 


US-10-205-194-127 


Sequence 127, App 


36 


630 


10. 


8 


266 


12 


US-10-276-774-2330 


Sequence 2330, Ap 


37 


625.5 


10. 


7 


236 


9 


US-09-729-674-20 


Sequence 20, Appl 


38 


625.5 


10. 


7 


236 


9 


US-09-765-205-26 


Sequence 26, Appl 


39 


625.5 


10. 


7 


236 


12 


US-10-408-967-2 


Sequence 2, Appli 


40 


625.5 


10. 


7 


269 


14 


US-10-106-698-6222 


Sequence 6222, Ap 


41 


539.5 


9. 


2 


168 


10 


US-09-809-391-563 


Sequence 563, App 


42 


539.5 


9. 


2 


168 


10 


US-09-882-171-563 


Sequence 563, App 


43 


539.5 


9. 


2 


168 


12 


US-10-164-861-563 


Sequence 563, App 


44 


520 


8. 


9 


222 


12 


US-10-267-502-428 


Sequence 428, App 


45 


519 


8. 


9 


241 


12 


US-10-660-946-3 


Sequence 3, Appli 



ALIGNMENTS 



RESULT 1 

US-09-893-348-18 

; Sequence 18, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

; APPLICANT: EISENBACH-SCHWARTZ , Michal 



APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/893, 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 18 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 

Query Match 100.0%; Score 5848; DB 9; Length 1163; 

Best Local Similarity 100.0%; Pred. No. 5.8e-287; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 PAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I M I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMSWA 360 

I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMSWA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 



Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

I I I II I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I II 
Db 481 QIITEKTSPKTSNPFLVAVQDSE^DYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I 
Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

I I I I I I II I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKAIiGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 721 PVDLFSDDSTPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I 
Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 901 DEVHVSDEFSENRSSVSKASISPSNVS7VLEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLS 1020 

I I I I II I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

Qy 1081 FLVDDLVDSLKFAVLMW/TTWGALFNGLTLLI 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I.I I I I I I I I I I I I I 
Db 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I I I I I I I I I I I I I I II 
Db 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 



RESULT 2 

US-10-2 67-502-4 31 <^ 

; Sequence 431, Application US/10267502 
; Publication No. US20040071700A1 



GENERAL INFORMATION: 
APPLICANT: Kim, Jaeseob 
APPLICANT: Galant, Ron 

TITLE OF INVENTION: Obesity Linked Genes 
FILE REFERENCE: LSD-07416 

CURRENT APPLICATION NUMBER: US/ 10/2 67 , 5 02 
CURRENT FILING DATE: 2003-01-27 
NUMBER OF SEQ ID NOS : 439 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 431 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Mus musculus 
US-10-267-502-431 

Query Match 90.7%; Score 5307; DB 12; Length 1163; 

Best Local Similarity 91.1%; Pred. No. 1.2e-259; 

Matches 1066; Conservative 34; Mismatches 56; Indels 14; Gaps 7; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I : I I I II I I I I I I I I I I II I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

I I I II I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 60 PAAGLSAVPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — P P AP AGAS P LAE P AAP P S T P AAP KRRG S G S VD ET L FAL P 176 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 177 AASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I : I : I I I I I I I I I I I I : I I I I I I I I I II I I : I I I I I I I I I I I I I I I I I I I I I 
Db 237 GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

Qy 301 AI LVENTKEEVIVRSKDKEDLVCSAALHS PQES P VGKEDRWS PEKTMDI FNEMQ 355 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I III I : I I I I I I I I I I I I : 

Db 297 AMLVENTKEEVIVRSKDKEDLVCSA7VLHNPQESPATLTKVVKEDGVMS PEKTMDI FN EMK 356 

Qy 356 MSWAPVREEYADFKPFEQAWEvT^DTYEGSRDVT.AARANVESWDRKCLEDSLEQKSLGK 415 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I : I I I I I I I I I I II 
Db 357 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKSHGK 416 

Qy 416 DSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKI 475 

III I I I : I I I I I I I I III I I I I I I I I I I I I I I I I II I I : I I I I M I I I I I I I I I 
Db 417 DSESRNENASFPSTPELVKDGSRAYITCDSFTSATESTAANIFPVLEDHTSENKTDEKKI 476 

Qy 476 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 535 

I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I I II I I I I I II I : I I I I I I I I I I I I I 
Db 477 EERKAQIITEKTSPKTSNPFLVAIHDSEADYWTDNLSKVTEAWATMPEGLTPDLVQEA 536 



Qy 



536 



CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



595 



Db 



537 CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 



Qy 596 PLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIK 655 

I I I I I II I I I I I III I I I I I I I I I I I I I I I I I II I I I I I I : I I I I II II 

Db 597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDAKEEIK 656 

Qy 656 EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 715 

I I I I I I I I II I I I M I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I : I llhll 
Db 657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEKSVPDHCELVDDS 716 

Qy 716 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPY 774 

. I I I I I I I I I II I I I I I II I I I I II I I I I I I I i I I I I I I I I I II : I I I I I I I I I : I I I I 
Db 717 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 776 

Qy 775 LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

I I I I I I I II I I I I I I I : I I I I I I I I I I I I I I I || I I I I I I I M I I I I I I I : I || I I I I 
Db 777 LESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFS 836 

Qy 835 DSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSF 894 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I : I I I I : I I I I I I I I I I I I 

Db 837 DSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSF 893 

Qy 8 95 KNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPS 954 

II I I I I I I I I II I I : : I I I I I I : I I I I I I I I I I : I I I I I I II II : I I I I 

Db 8 94 KNTYPKDEAWSDEFSKSRSSVSKVPLLLPWSALESQIEMGNIVKPKVLTKEAEEKLPS 953 

Qy 955 DTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGW-FGASLFLLLSLTVFSIVSVTAY 1013 

I I I II I I I I : I I I I II I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 954 DTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWYFGASLFLLLSLTVFSIVSVTAY 1013 

Qy 1014 IALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNST 1073 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I 
Db 1014 I ALALLSVT I S FRI YKGVIQAIQKSDEGHPFRAYLES EVAI SEELVQKYSNSALGHVNST 1073 

Qy 1074 IKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVIYERHQVQI 1133 

I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I II II 

Db 1074 I KE L RRL FL VD DL VD S L K FAVLMWVFT YVGAL FN GLT L L I LAL I S L F S I P VI YERHQ AQ I 1133 

Qy 1134 DHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I : 
Db 1134 DHYLGLANKSVKDAMAKIQAKI PGLKRKAE 1163 



RESULT 3 
US-09-758-140-6 

; Sequence 6, Application US/09758140 
; Patent No. US20020012965A1 
; GENERAL INFORMATION: 

; APPLICANT: Strittmatter , Stephen M. 

; TITLE OF INVENTION: No. US2 002 0012 965Alo Receptor-Mediated Blockade of 
Axonal Growth 

; FILE REFERENCE: 44574-5073-US 

; CURRENT APPLICATION NUMBER: US/09/ 758 , 14 0 

; CURRENT FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: US 60/175,707 

; PRIOR FILING DATE: 2000-01-12 

; PRIOR APPLICATION NUMBER: US 60/207,366 



PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: US 60/236,378 
PRIOR FILING DATE: 2000-09-29 
NUMBER OF SEQ ID NOS : 2 0 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 6 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-758-140-6 

Query Match 75.3%; Score 4403.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 5.6e-214; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I II I I I I I I I I I I I I I I : I I :: I I I I I I I II I I I I 

Db 1 MEDLDQSPLVSSS-DS'PPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL S AAAVP — P AAAAP LLD FS S D S VP PAP RGP LPAAP PAAP ERQ P S WERS PAA P 115 

I I I I I I II M Ml I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I : II : I 

Db 59 PAAGL S AAP VPT APAAGAP LMD FGND FVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I ' I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I I I I I I I I I I I I I I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVS FKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : I I I I I I I I I I 
Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 S EMGS S FKGS PKGES AI LVENT KEEVI VRS KDKED- LVCS AALHS PQES PVG KED 339 

I I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : ' I I : I I I III 
Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPWEEYADFKPFEQAWEWDTYEGSRDVLAA RANV 395 

I I I II I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

I I I I I : I I Mill: I I I I I : I M II I I I : I I I I I I I I I I : I I I I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I II I I I I I I I I I I II I I : I I I I : I I I II I I I I I I I i I I I I I I I I I I I I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : I I I I I I I I I I I II II I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I III 
Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I I : I : I I I I I II 



Db 



597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 



Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : I : I I : I II Mill: I I I : I II I I I II II I II I I I I I I I I I I : II I 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : II : I ": I I I I I M I : I II I I I I I I I I I I : I I I I : I I I I : M II I I I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I I I I I I I I I I I I : : I : I I I : : : I I : I I I I I Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

M I M I M I I Ml M : I : I I I M I II M M I M II MM II II I M M II M I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I I M II I I I II I I I I I I I I I I I I : : : II M I M I : II : I M 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

I II I I : Ml! I I I I I I I I II I I I I I I I I I M : I II I I I I I I I I II I I I I I I 

Db 956 SALATQAEI ES I VKPKVLVKEAEKKLPS DTEKEDRS PSAI FSAELS KT S WDLLYWRDI K 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I II I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I I I I I I I I I I I I I II I I I I I I M 

Db 1016 KTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTWGALF 1106 

I I I M I I I I I I I I I I II II I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 107 6 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALF 1135 

Qy 1107 NGLTLLIIJVLISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

II M I I II I I I I II I M II I I I I I I I I M I II I II M I II I I I I I I I I II I I I II : 

Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 4 

US-09-972-599A-6 

; Sequence 6, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER, STEPHEN M. 

; TITLE OF INVENTION: NOGO-RECEPTOR-MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

; CURRENT APPLICATION NUMBER: US/ 09/972, 59 9 A 

; CURRENT FILING DATE: 2001-10-06 

; PRIOR APPLICATION NUMBER: PCT/US01/01041 

PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 09/758,140 

PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 60/236,378 
; PRIOR FILING DATE: 2000-09-29 

PRIOR APPLICATION NUMBER: 60/207,366 
; PRIOR FILING DATE: 2000-05-26 



; PRIOR APPLICATION NUMBER: 60/175,707 

; PRIOR FILING DATE: 2000-01-12 

; NUMBER OF SEQ ID NOS : 57 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 6 

LENGTH: 1192 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-972-599A-6 , 

Query Match 75.3%; Score 4403.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 5.6e-214; 

Matches 909; Conservative. 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I : : I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA— P 115 

I I I I I I I I II III I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I : II : I 

Db 59 P AAGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I -II II II I I I I I I I I II I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I I I I I I I I I I I I I I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LS PLSTVS FKEHGYLGNLSAVS S SEGTI EETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : I I I I I I I I I I 

Db 238 LSPLSAAS FKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

I I I I I I I I I t I I I : : I I : I I : I I : : I I : I : I I : I I : I I I III 
Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

llllhll I I I I I : I I I I I : I I I I I I I I : I I I I I I I I I I : I I I I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I I I I I I I I : I I I I : I I I II I I I I I I I I I I I I I I I I I II I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : II I I I I I I I I I II I I I I I I I I I I I I I I I II I : I I I I I I I : II I I I I III 
Db 537 KVT E E WANMP E GLT P D LVQ EACE S ELN EVT GT K I AY ET KMD LVQT S EVMQE S L Y PAAQ L 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I II II : I I I I M I I I I I I I I I I I I I : I I I I I I I : I I I II I I I I : I : I I I II I I 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 



Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I M : I : I I : I II Mill: I I I : I I II I I I I I I I I I M I I I I I I I I : I I I 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 



Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAW1LMKESLTEVS 752 

I : I II : I I I : I I : I : I I I I I I I I : I I I I I I I I I I II I : I I I hi I I I : I I I I I I I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I :::::: I : I I I I I I I I I I I I I :: I : I I I : : : I I : I I I I I I I I I I 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 



Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: || : I || I I I III : I : I : I I I I I I I I I I I I I I I I I : I : I II I I I : I II I I I I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 



Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I I I I I I I I II I I I I I I II I I I : : : I I : I I : I I : I I : I : I 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

III I I : MM I I II II I I II I I I II I I M II: I I I I II I I I I I I I M I I I I 

Db 956 SAIATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 



Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 104 6 

I I I II I I II I I I I II I I I I I II I I I I I M I I I I I II I II I I I I I I I I M I I I I M I I M I 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

II I I I I I I I I I II M I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I I II I I M I I I I 

Db 1076 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLIIALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I II I I I II I M I II : I II I I I I I I I I I I I II I I I : I I I I I I I I II II I I I I I I I : 
Db 1136 NGLTLLIl^ISLFSVPVIYERHQAQIDHYLGIANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 5 

US-10-060-036-71 

Sequence 71, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT: Benson, Darin R. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes, Michael J. 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/ 10/ 060, 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 71 



LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-060-036-71 

Query Match 75.3%; Score 4403.5; DB 14; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 5.6e-214; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTE-PEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

IM:||| I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL S AAAVP — PAAAAPLLDFS SDSVPPAPRGPLPAAP PAAPERQPSWERS PAA P 115 

I I I I I I I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I : I I : I 

Db 59 P AAGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQ P S W D P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 119 AP S P L S AAAVS PSKLPEDDEP P ARP P P P P P AS VS PQAE P VWT P PAP AP AAP P S T P AAP KR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I II I I I I I I MM Ill I I I I I I : I : I I I I I I II I I I I I I I I I I 

Db 179 RGS S GS VDET L FAL PAAS E P VI RS S AEN -MDLKEQ P GNT I S AGQED FP S VLLET AAS LPS 237 

Qy 226 LSPLSTVS FKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFWRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I I ! I : I : I :: I I I I I I I I LI 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 2 86 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I I II I I I I I I I I I : : I I : II : I I : : I I : I : I I : I I : I I I I I I 

Db 2 98 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : I : 

Db 358 EWS S EKAKDS FNEKRVAVEAPMREE YADFKP FERVWEVKDS KEDS - DMLAAGGKI ESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

11111:11 I I I I I : I I I I I : I I I I I I I I : I I I I I I I I I I HIM 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 4 55 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I I I I I I I I : I I I I : I I I II I I I I I I I I I I I I I I I I I I I I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSET.DYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : I I I I I II I I I I I I I I I I I I I I 1111111111:1111111 : I I I I I I III 
Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I : I I I I I I I I II II I I I I I II : I I I I I I I : I I I I I I I I I : I : I I I I I I I 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I II I : I : I I : I II Mill: I I I : I I I I I I I M I I I I I I I I I I II I I : I I I 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 



Qy 

Db 



693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 7 52 

I : I I I : I I I : I I : I : I I I I I I I I : I i I i M I I I I I I I : I I I I : I I I I : I I I I I I I 
716 SDYSEMAWEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 

1 : : : : : : I : I I I I I MINIM: : | MM : : : I M M I I I I I M I 

776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 



810 



836 



869 



896 



809 



835 



868 



NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 

: I I : I I M I I Ml M M M I I M I I M M II I M I MM I I I M M I I I M M 

STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 8 95 



DKSEIANIQSGADSLPCLELPCDLSFKNIYPK--DEVHVSDEFSENRSSVSKASISPSNV 

M M M M MM Ml Ml Ml M : : : I I M I M M I I : I M 
HKSEI7\NAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 



927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVi 

III I I: MM I I I I I I I I II I I II I II I I II: 
956 SALATQAEI ES I VKPKVLVKEAEKKLP S DTEKEDRS P SAI 



-987 



926 



955 



986 



SAEL SKT S WDLL YWRDI K 
I I I I I I II I II II I II II 
ElSAELSKTSWDLLYWRDIK 1015 



KTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVIQAI QKS DEGH P FRA 104 6 

I I I II I I I I I I I I I I II I I I I I II II II I II I II M M I I I II I I M I I I M I M I M II 

1016 KTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVIQAI QKS DEGH P FRA 1075 

1047 YLES EVAI S EELVQKYSNSALGHVN ST I KELRRL FLVDDLVDS LKFAVLMWVFT YVGALF 1106 

I I I II I I I I II I I I I II I I I I II II I I I II I II I I I I II I II I I I I M I M I I M M II 
1076 YLES EVAI S EELVQKYSNSALGHVNCT I KELRRL FLVDDLVDS LKFAVLMWVFT YVGALF 1135 



1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 

I I I I M I I I I I I I I I M I I I I II I I I I I II I II I I : I I I I M I I II I I I I I II I I : 
1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 



1163 



1192 



RESULT 6 
US-09-789-386-2 

Sequence 2, Application US/09789386 
Patent No. US20020010324A1 
GENERAL INFORMATION: 
APPLICANT: MI CHALOVICH , DAVID 
APPLICANT: PRIN JHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-C1 
CURRENT APPLICATION NUMBER: US/09/789,386 
CURRENT FILING DATE: 2 001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U . K.98 16024 . 5 
PRIOR FILING DATE^1,9L9.8— 



PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: J L9,air07 -22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3 
SEQ ID NO 2 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-2 




Query Match 75. 2%; Score 4398.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 1 I I I I I I II I I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 P AAG L S AAAVP — P AAAAP L L D FS S D S VP PAP RG P L P AAP P AAP E RQ P S WE RS P AA P 115 

I I I I I I I I II III I I I : I I : I II I I I I I I I I I I I I II I I I I I I : II : I 

Db 59 P AAGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I I I I I I I I I I I I I II I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 2 85 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I II I : I : I : : I I I I I I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVIVRSKDKED-LVCSAALHS PQES PVG KED 339 

I I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I I I III 
Db 298 SEMGS S FSVS PKAESAVI VANPREEI IVKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

Qy 34 0 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : I : 

Db 358 EWS S E KAKD S FN E K RVAVEAPMRE E YAD FK P FE RVWE VKD S KED S - DMLAAGGK I E S N L 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

11111:11 Mill: I I I I I : I I I I I I I I : I I I I I I I I I : I I I I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I I I I I I I I : I I I I : I I I II I II I I I I I I I I I I I I I I I I I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I III 
Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPS FEEAEAT P S P VL P D I VMEAPLN S LL P S AGAS WQ P S VS P LEAP P P VS YD S I KLE P EN 633 

I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I I : I : I I I I I I I 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : I : I I : I II Mill: I II : II I II I I I M II I I I II II I I I I : I I I 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : M I : II I : II : I : I II I I II I : II I I I I I I I II I I : I I I Ml M M I I II I I I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 



Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I I I I I I I I I I I I : : I : II I : : : I I : I i I I I I I I I I 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy -810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : I II I I I III : I : I : I I I I II I I I I I I I I I I I : I : I II I I I : I I I I I I I I 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 8 69 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I I II I I I I I I I M I I I I I I I I : : : I I : I I : I I : II : I : I 

Db 8 96 -HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVllsAELSKTSWDLLYWRDIK 986 

III I I : I II I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFpAELSKTSVVDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIAIALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YL E S EVAI S E ELVQ KYS N SAL GHVN S T I KEL RRL FLVD DL VD S L K FAVLMWVFT YVGAL F 1106 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAD VL163 

I I I I I I I I I I I I I I I : II I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I II I II I I : 1 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE L192 

RESULT 7 

US-09-893-348-23 

Sequence 23, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/893,348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/147 15 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 23 
LENGTH: 1192 
TYPE: PRT 



ORGANISM: Homo sapiens 
US-09-893-348-23 



Query Match 75. 2%; Score 4398.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104;. Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I (Mill I I I I I I I I I I I I I I I I I I I : I I : : I I I I I I I I I I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAP LLDFS SDSVP PAPRGPLPAAP PAAPERQP SWERS PAA P 115 

MINIM II III 111:11 : I I I I I I I I I I I I I I I 11111111:11 : I 
Db 59 P AAGL S AAP VP T AP AAGAP LMD FGN D BVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP KR 166 

III I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG - S G S VDET L FAL PAAS E P VI P S S AE KI MDLMEQ P GNT VS S GQ E D F P S VLLETAAS LPS 225 

II I I I I I I I I I I I I I I I I I II I II I III I I I I I I : I : I I I I I I I I I I I I I I I I I I 
Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : I I I I I I I I M 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I II I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I I I III 

Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Ml II I III : : : I I I : I I I I I I I I I I I : Mill: I I Mill : I : 

Db 358 E WS S E KAKD S FN E KRVAVEAPMREE YAD FKP FE RVW EVK DS KE D S - DMLAAGGK I E SN L 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

II I II : I I Mill: I I I I I : I I I I I I I I : I I I I I I I I I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 47 6 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I II I I I M I II I I I I II I I : I I I I : I I I II II II I I I I I I I II I I I I I I I : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : I I II I I II I I I I I I I I I I I I I I I II I I I I II : I I I I II I : I I I II I III 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I II I : I I II I I I I I II I I I II I I I : II I I I I I : I I I I I I II I : I : I I I I I II 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYI SIACDLIKETKLSTEPSPDF 692 

I I I II II I : I : I I : I II I II I I : I I M I I I I I I II I I I II II I I II I I I I : I M 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : I I : I : I II I I I M : I I I I I I I II I I I I : I I I Ml I M : II I I I I I 



Db 



716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 



Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I i I I MINIM: : I MM : : : M M I I I I Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

M I M I I II I III M M M I I I I I I I I I I I I I I II MM II M I M I I M I I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

MINI I I I I I I I M I I I I I I I I : : : II M I M I : I I : I M 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml I I : MM I I I I II I I I II I II I I M I Ih I I I I II M M I I I II I II I 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGVVFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRA 1046 

I II I II I M I I I I II I I I I I II I I I I II I I I II I I I II M II I I I I I I I I I M I I I M II 

Db 1016 KTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI S FRI YKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLE S EVAI SEE LVQ K Y S N SAL GHVN S T I KE L RRL FLVDD LVD S L KFAVLMWVFT YVGAL F 1106 

I I I I I I I I II II II I I I I I I I I I I I I I I I I II I I I I I II I II II II I I I I I II M I I I I 

Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I II I I I II I I I M I I I I I I I I I I I I I I II I I : I I I I I I I I I I I I I I I II II : 
Db 1136 NGLTLLIIALISLFSVPVIYERHQAQIDHYLGIANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 8 

US-10-267-502-429 

; Sequence 429, Application US/10267502 

; Publication No. US20040071700A1 

; GENERAL INFORMATION: 

; APPLICANT: Kim, Jaeseob 

; APPLICANT: Galant, Ron 

; TITLE OF INVENTION: Obesity Linked Genes 

FILE REFERENCE: LSD-07416 
; CURRENT APPLICATION NUMBER: US/ 10/267 , 502 
; CURRENT FILING DATE: 2003-01-27 
; NUMBER OF SEQ ID NOS : 439 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 429 
LENGTH: 1192 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-267-502-429 

Query Match 75.2%; Score 4398.5; DB 12; Length 1192; 

Best Local Similarity 75.9%; PrecL No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I M I I I I I I I MIMI I I I I I I I I I I I M I II I I I M I : : I I I I II I I I I I II 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 



61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I I I I I I I I I : I I : T I I I I I II I I I I I I I I II I I I I I : I I : I 

59 P AAG L S AAP VP T AP AAGAPLMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP 118 

116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I 

119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I I I I I I I I I I I I I I I I 

17 9 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

22 6 LS PLSTVSFKEHGYLGNLSAVS S S EGTI EETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : II I I I I I I I I 

238 LSPLSAAS FKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

286 S EMGS S FKGS PKGESAI LVENTKEEVI VRS KDKED- LVCSAALHS PQES PVG KED 339 

I I I I I I I I II I I I : : I I : II : I I : : I I : I : I I : | | : | | | Ml 
298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPT7VLTKLVKED 357 

340 RVVSPEKTMDIFNEMQMSWAPVREEYAJ^FKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I i :[ I I : I : 

358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

I I I I I : I I Mill: II I I I : I II I I I II : I I I II I II I Mill 
417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I II I I I I II I I M M I I I M II II I I M II I I I II I I MINI: 
477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

514 KVT EAAVSNMP E GLT P D LVQEACE S E LN EAT GT K I AYET KVD LVQT S EAI Q E S L Y P T AQ L 573 

I I I I I M I I I II I II I I I II I I I I I I I I I I I I I I I I I M I I I I II M II I I I III 

537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

II I I I I M M I II I I I I I I I II I I I I M I I I I I I M II I II I I I M M I I I I I I 

597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESTKHEPEN 655 

634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I M : II : I II Mill: I II M I II I I I II I I II I II M II I M : II I 
656 PPPYEEAMSVSLKKVSGIKEEIKEPENINA7VLQETEAPYISIACDLIKETKLSAEPAPDF 715 

693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : M I : I I I : I I : I : I I I I I I I I : I I I I I I I I I I I II : I I ! I : I 111 = 111111 I 
716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

753 -ETVAQHK- EERLSAS PQELGKP YLES FQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEF 809 

I : : : : : : I M II I I II I I I II I : M MM : : : II M I II I Mill 

77 6 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : I M I I I II I : I M : I I I I II I II I I I I I I II : I M I I I II : I I II II I I 
836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKliAREYTDLEVS 895 



Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 92 6 

I I I I I I I I I I I I I I I I I I I I I i I : : : I I : I I : I I : I I : I : I 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 98 6 

Ml I I : I I I I I I I I I I I I I I I I I I I I I I I II: I I I II I I I I I I I I I I I I I I 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGAS LFLLLSLTVFS I VS VTAYI ALALLSVT I S FRI YKGVI QAI QKS DEGH P FRA 1046 

I I I I I I I I I II I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1016 KTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLSVT I S FRI YKGVI QAI QKS DEGHP FRA 1075 

Qy 1047 Y L E S E VAI S EELVQ K Y SN S AL GHVN S T I KEL RRL FLVD D LVD S L K FAVLMWVFT YVGAL F 1106 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1076 YLE S E VAI S EELVQ K Y SN S AL GHVN CT I KEL RRL FLVD D LVD S L K FAVLMWVFT YVGAL F 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : 
Db 1136 NGLTLLI LALI S LFS VPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 9 
US-10-327-213-9 

Sequence 9, Application US/10327213 
Publication No. US20040121341A1 
GENERAL INFORMATION: 
APPLICANT: FILBIN, MARIE T. 
APPLICANT: DOMENICONI, MARCO 
APPLICANT: ' CAO, ZIXUAN 

TITLE OF INVENTION: INHIBITORS OF MYELIN-ASSOCIATED GLYCOPROTEIN (MAG) 
TITLE OF INVENTION: ACTIVITY FOR REGULATING NEURAL GROWTH AND REGENERATION 
FILE REFERENCE: CUNY/ 00 3 

CURRENT APPLICATION NUMBER: US/ 1 0/327 , 2 13 
CURRENT FILING DATE: 2 002-12-20 
NUMBER OF SEQ ID NOS: 43 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 9 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-327-213-9 

Query Match 75.2%; Score 4398.5; DB 16; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I MINIM | | | | | | I I I II : I I :: I I II I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL S AAAVP — PAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAA P 115 

II I I I I I I II III I I I : I I : I I I! II I I I I II I I I I I I I I I I I : II : I 
Db 59 PAAGL S AAP VPT APAAGAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ P S WD P S P VS S T VP 118 



Qy 

Db 



116 
119 



APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 

III I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I 

APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 



166 
178 



Qy 167 RG-SGSVDETLF7VLPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I II I I I I III I I I I I I : I : I I I I I I II II I I I I I I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVS FKEHGYLGNLSAVS SSEGTI EETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I II II I I I I I I I : I I I : : I : : I I I I I : I : I : : I I I I I I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

I II I I I I I I I I I I : : I I : I I : I I : : II : I : I I : I I : I I I III 

Db 2 98 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

III II I III : : : I I I : I I I I I I I I II I : INN: I I I : I I I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

I I I II : I I Mill: I I I I I : I I II II I I : I I I I I I I I I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSE7VDYVTTDTLS 513 

I I I I I I II II I I II I I I I I : I I II : I M II I I II I M I I Ml I I I II I I I : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I II I I : I I II I I I I I I I I I M I I I II I' I I II I I I II I : I I I I I II : I I I I M III 

Db.. 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I M I I : I I I II I I I I II I I II I II I : I I I I II I : I II I II I I I : I M M I I I I 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PP PYEEAMNVALKAL- GTKEGI KEPES FNAAVQETEAP YI S I ACDL1 KETKLSTEPS PDF 692 

I I I II II I : I : I I : I II Mill: I I I : I II II I I I I II I I I M I I II I II : I I I 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I II : II I : II : I : I I I I I II I : I I I I I II M I I I I : I I I Ml II I : I II II I I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : II I I I II I I I I II : : I MM : : : I I : I II I I II I I I 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: M : M II I I III : I : I : M II I M I I II II I II I MM II I I I M II II II I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I I II I I II I II I II I I I I II I I I : : : 11 = 11*1 I = II : I : I 

Db 8 96 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

III I I : I I II I I I I I I I I I II I I I I I I M II: I I I II I I II I I II II I I II 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFS7VELSKTSWDLLYWRDIK 1015 



Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLES EVAI S EELVQKYSN S ALGHVN ST I KELRRLFLVDDLVD S LKFAVLMWVFT YVGALF 1106 

I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I II I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I M I I I I I I I I : 
Db 1136 NGLTLLILALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1192 



RESULT 10 
US-10-466-258-9 

; Sequence 9, Application US/10466258 

; Publication No. US20040132096A1 

; GENERAL INFORMATION: 

; APPLICANT: GLAXO GROUP LIMITED 

; TITLE OF INVENTION: ASSAY 

; FILE REFERENCE: P80966 GCW 

; CURRENT APPLICATION NUMBER: US/10/466, 258 

; CURRENT FILING DATE: 2003-07-15 

; NUMBER OF SEQ ID NOS : 13 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 9 

LENGTH: 1192 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-466-258-9 

Query Match 75.2%; Score 4398.5; DB 16; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I I I I I I I I I I I I I II I I : II :: I I I I I I I I I I I I I 
1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III 111:11 : I I I I I M I I I I I I I I I I I I I II I : II : I 
59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

119 AP S P L S AAAVS PSKLPEDDEP PAR P P P P P P AS VS P QAE P VWT P P AP AP AAP P S T P AAP KR 178 

167 RG- S GSVDETLFALPAAS EPVI PS SAEKIMDLMEQPGNTVS SGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I I I II III I I I I I I : I : I I I I I II I II I I I I I I I I 
17 9 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I II I I I I I I I : I II : : I : : ! I I M : I : I :: I I I I I I I I I I 

238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

286 S EMGS S FKG S P KGE S AI LVENT KEE VI VRS KDKED- LVC S AALH S PQ E S P VG KED 339 

I I I I II I I I I I I I : : I I : II : I I : : I I : I : I I : I I : II I III 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

III II I III : : : I I I : I I I I I I I I I I I : INN: I I hill : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

llllhll Mill: MM I : I I II I I M : I I I I I I I I I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 47 6 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I I II M I I I I II I I M II I : I M M I I I II I I II I I I I I I I I II .1 : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLV7\AQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : II I I I I II II I I I I I I I I I I I I I II I I I II I : I II I I II : I II I I I I II 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I M I I I I II I I I I I II II I I I M II M I I : I I I I I I II I : I : I I I I I I I 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I II II I I I : I : I I : I II M II I : I I I : I I I I II I I I I I I I II M M M I I M I I 

Db 656 PPP YEEAMSVSLKKVSGI KEEI KEPENINAALQETEAPYI S IACDLI KETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : I I M : I II I I II I : I II I M I I II I I I M I I I : I I I I : I I I I II I . 

Db 716 SDYSEM7VKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I I I I I I I I II II : M MM : : : I I M II II I I II I 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAI YSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

M I : I I II I I III : I : I M II I M II M II I I I II MM II I I I : I I I I I I I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I I I I I I I II I M I I I I I I I I I : : : II M I M I : M : I M 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

III I I : I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I II I I I I II II I I I 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

M I I I I I I I I I I I I I I II I II I I I II I I I I I M I I I I I I M I I I I M I I II I I I I I II I I 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I I II I I II II I I I I II I I I I I I I I I I I I I I II I I I I I I II I I I II II I II I I I I I I II I 

Db 1076 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRK7VD 1163 

I I I I I II II II I I I I : I I M I I II I I I I I I I I I I I M I I I I I I II I II I I II I I I : 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 11 
US-10-408-967-7 

Sequence 7, Application US/10408967 
Publication No. US2004 0063161A1 
GENERAL INFORMATION: 
APPLICANT: Pharmacia & Upjohn Company 
APPLICANT: Yan, Riqiang 
APPLICANT: Lu, Yifeng 

TITLE OF INVENTION: Compositions and Methods of Treating Alzheimer 1 s Disease 
FILE REFERENCE: 00925 

CURRENT APPLICATION NUMBER: US/10/408, 967 
CURRENT FILING DATE: 2003-04-08 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 7 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-408-967-7 

Query Match 75.1%; Score 4389.5; DB 12; Length 1192; 

Best Local Similarity 75.8%; Pred. No. 2.9e-213; 

Matches 907; Conservative 104; Mismatches 147; Indels 39; Gaps ' 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQS.PLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFS SDSVP PAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I : II : I 

Db 59 PAAGL S AAP VPTAPAAGAP LMD FGND FVP P AP RG P L PAAP P VAP E RQ P S WD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I I I I I I I I I I I I I I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : I I I I I II I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED- LVCSAALHSPQES PVG KED 339 

I I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I I I III 
Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I II I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I ': I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

11111:11 I I I I I : I I I I I : I I I I I I I I : I I I I I I I I I : I II I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 47 6 



Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I II I I I I I I I I II I I I I : I I I I : I I I II I I I I I I I I I I II I I I I I M I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVj^AQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I I M I III 
Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I : I I I I I I I I II I I I I I I I I I : I I I I I I I : I I I I I I I I I : I M I I I I I I 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : I : I I : I II Mill: I I I : I I I I I I I I I I I I I I I I I I I II I I : I I I 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : I I : I : I I I I I I I I : I I I I I I I I I I I I I : I I I hi I I I : I I I I I I I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I I I I MINIM: : I : I I I : : : 11 = I I I M 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLINTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : I I I I I I III : I : I : I I I I II I I I I I I I I I II MM II I I I M I I I I I I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 8 95 , 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I II I I I I I I I I I I I I I I I I I I I I : : : I I : I I M I : I i : I : I 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

III I I : I I I I I I I I I I I I I I I I I I II I I I II: I I I I I I I I I I I I I I I I I II 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLSVTI S FRI YKGVI QAI QKS DEGHPFRA 1046 

I I I I I I I I I I I I II I I I I I II I I I I II I I I I I I I I I I I I II I I I I I II I I I I I M I I I I 
Db 1016 KTGWFGAS LFLLL S LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKSDEGHP FPA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1076 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGL7\NKSVKD7\MAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I II M M II I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I :* 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 12 
US-10-205-194-164 

; Sequence 164, Application US/10205194 
; Publication No. US20030134301A1 
; GENERAL INFORMATION: 

APPLICANT: Warner-Lambert Company 
; APPLICANT: Lee, Kevin 



APPLICANT: Dixon, Alistair 
APPLICANT: Brooksbank, Robert 
APPLICANT : Pinnock, Robert 

TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
FILE REFERENCE: WL-A-018201 

CURRENT APPLICATION NUMBER: US/10/205, 194 
CURRENT FILING DATE: 5200-07-24 
.PRIOR APPLICATION NUMBER: GB 0118354.0 
PRIOR FILING DATE: 2001-07-27 
NUMBER OF SEQ ID NOS : 177 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 164 
LENGTH: 379 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
FEATURE: 

OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-164 

Query Match 25.9%; Score 1513; DB 14; Length 37 9; 

Best Local Similarity 32.6%; Pred. No. 1.3e-68; 

Matches 379; Conservative 0; Mismatches 0; Indels 784; Gaps 1; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M II I I I I I II I I I I I I M I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLF7VLP 180 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I I I I I I I 

Db 181 AASEPVIPSSA 191 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Db 192 191 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Db 192 : 191 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Db 192 191 

Qy 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Db 192 191 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYWTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 



Db 192 191 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db 192 191 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Db 192 191 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Db 192 191 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 192 191 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 192 191 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 192 r 191 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 192 — 191 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 192 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 236 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 237 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 296 

Qy 1081 FLVDDLVD S LKFAVLMWVFT YVGAL FNGLT LLI LAL I S L FS I P VI YERH QVQ I DH YLGLA 1140 

I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

Db 297 FLVDD LVD S L K FAVLMWVFT YVGAL FN GLT LL I LAL I S L F S I P VI YE RH QVQ I DH Y LGLA 356 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I II I I I I I I I I I I I I I I I I I I 

Db 357 NKSVKDAMAKIQAKIPGLKRKAD 379 



RESULT 13 
US-09-893-348-20 

Sequence 20, Application US/09893348 
Patent No. US2002 0072493A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



EI SENBACH- SCHWARTZ , Michal 
COHEN, I run R. 
BESERMAN, Pierre 
MOSONEGO, Alon 
MOALEM, Gila 



; TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

; FILE REFERENCE: EI S- SCHWARTZ =2 A 

; CURRENT APPLICATION NUMBER: US/09/893, 348 

; CURRENT FILING DATE: 2001-06-2 8 

/ PRIOR APPLICATION NUMBER: US 09/314,161 

; PRIOR FILING DATE: 1999-05-19 

; PRIOR APPLICATION NUMBER: US 09/218,277 

; PRIOR FILING DATE: 1998-12-22 

; PRIOR APPLICATION NUMBER: PCT/US98/14715 

; PRIOR FILING DATE: 1998-07-21 

; PRIOR APPLICATION NUMBER: IL 124500 

; PRIOR FILING DATE: 1998-05-19 

; NUMBER OF SEQ ID NOS : 29 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 20 

LENGTH: 360 

TYPE: PRT 
; ORGANISM: Rattus norvegicus 
US-09-893-348-20 

Query Match 24.1%; Score 1411.5; DB 9; Length 360; 

Best Local Similarity 31.0%; Pred. No. 1.7e-63; 

Matches 360; Conservative 0; Mismatches 0; Indels 803; Gaps 1; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 P AAGL S AAAVP P AAAAP LLD F S S D S VP PAP RG P L P AAP P AAP E RQ P S W E RS P AAP AP S L P 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I M I I I I II I I 
Db 61 P AAGL S AAAVP P AAAAP L L D F S S D S VP PAP RG P L P AAP P AAP ERQPSWERS P AAP AP S L P 120 

Qy 121 PAAAVXPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I II I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSV 172 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 



Db 173 ■■ 172 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFWRDLAEFSELEYSEMGSSFKGSPKGES 300 

Db 173 — 172 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Db 173 172 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Db ' 173 172 

Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Db 173 172 



Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Db 173 172 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVNEAPLNSL 600 

Db 173 172 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Db 173 172 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Db 173 — ■ 172 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db • 173 ) 172 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 173 172 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 173 172 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 
Db 173 172 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAIiALLS 1020 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 173 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 217 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 218 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 277 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 278 FLVDDLVDSLKFAVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 337 

Qy 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 

II I I I I I I I I I II I I I I I I I I II 

Db 338 NKSVKDAMAKIQAKI PGLKRKAD 360 



RESULT 14 
US-09-789-386-6 

Sequence 6, Application US/09789386 
Patent No. US20020010324A1 
GENERAL INFORMATION: 
APPLICANT: MICHALOVICH, DAVID 
APPLICANT : PRINJHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 



; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789, 386 

; CURRENT FILING DATE: 2001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 

; PRIOR APPLICATION NUMBER: US 09/359,208 

; PRIOR FILING DATE: 1999-07-22 

; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 6 

LENGTH: 373 

TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-6 

Query Match 20.4%; Score 1191; DB 9; Length 373; ' 

Best Local Similarity 27.8%; Pred. No. 2.4e-52; 

Matches 327; Conservative 12; Mismatches 20; Indels 818; Gaps 7; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I II I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL S AAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I : II : I 
Db 59 PAAGL SAAPVPTAPAAGAP LMD FGNDFVP PAPRGP LPAAP P VAP E RQ P S WD P S P VS ST VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 



Db 179 RGSS 182 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Db 183 182 

Qy 287 EMGSSFKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEK 346 

Db 183 182 

Qy^, 347 TMDIFNEMQMSWAPVREEY7U)FKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED 406 

Db 183 182 

Qy 407 SLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTS 466 

Db. 183 182 

Qy 4 67 ENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG 526 

Db 183 ■ 182 



Qy 



52 7 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSP 58 6 



Db 183 182 

Qy 587 VLPDIVMEIAPLNSLLPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK 646 

Db 183 , 182 

Qy 647 ALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVP 706 

Db 183 182 

Qy 707 EHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS 766 

Db 183 182 

Qy 767 PQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDK 826 

Db 183 182 

Qy 827 IKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCL 88 6 

Db 183 182 

Qy 887 ELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTK 946 

I I : 

Db 183 GSV 185 

Qy 947 EAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFS 1006 

I I II I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 186 WDLLYWRDIKKTGWFGASLFLLLSLTVFS 216 

Qy 1007 IVSWAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 1066 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 217 IVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 276 

Qy 1067 LGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTWG7VLFNGLTLLIL7VLISLFSIPVIY 1126 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I 
Db . 277 L GH W CT I KE L RRL FLVD DLVD S L K FAVLMWVFT YVGAL FN GLT L L I LAL I S L FS VPVI Y 336 

Qy 1127 ERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : 
Db 337 ERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 373 



RESULT 15 
US-09-765-205-6 

; Sequence 6, Application US/09765205 
; Patent No. US20020034800A1 
; GENERAL INFORMATION: 
; APPLICANT: Cao, Li 

; TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 

; FILE REFERENCE: 1458.004/200130.449 

; CURRENT APPLICATION NUMBER: US/09/765, 205 

; CURRENT FILING DATE: 2001-01-17 

; PRIOR APPLICATION NUMBER: US/09/212,440 

; PRIOR FILING DATE: 1998-12-16 



NUMBER OF SEQ ID NOS : 4 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 6 
LENGTH: 373 
TYPE: PRT 
ORGANISM: human 
US-09-765-205-6 

Query Match 20.4%; Score 1191; DB 9; Length 373; 

Best Local Similarity 27.8%; Pred. No. 2.4e-52; 

Matches 327; Conservative 12; Mismatches 20; Indels 818; Gaps 7; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I I I I II I I I I I I II I I I : I I : : I I I I I I I I I I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III llhll : I I I I I I I I I I I I I II I I I I I I I I : II : I 
Db 59 P AAG L S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 
I I I 

Db 179 RGSS 182 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Db 183 182 

Qy 287 EMGSSFKGSPKGESAI LVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEK 346 

Db 183 182 

Qy 347 TMDIFNEMQMSVVAPVI^EEYADFKPFEQAWEVKDTYEGSRDVIiAARANVESKVDRKCLED 406 

Db 183 182 

Qy 407 SLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTS 466 

Db 183 ^ 182 

Qy 467 ENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG 526 

Db 183 182 

Qy 527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSP 58 6 

Db 183 182 1 

Qy 587 VLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK 64 6 

Db 183 182 

Qy 647 ALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVP 706 



Db 183 — - 182 

Qy 707 EHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS 766 

Db 183 -= 182 

Qy 767 PQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDK 826 

Db 183 182 

Qy 827 IKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCL 886 

Db 183 182 

Qy 887 ELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTK 946 

Db 183 GSV 185 

Qy 947 EAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFS 1006 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 186 WDLLYWRDI KKTGWFGAS LFLLLSLTVFS 216 

Qy 1007 I VSVTAYI ALALLS VT I S FRI YKGVI QAI QKSDEGH PFRAYLES EVAI S EELVQK YSNS A 1066 

I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 217 I VSVTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYSNSA 276 

Qy 1067 LGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVIY 1126 

I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I : I I II 
Db 277 LGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIY 336 

Qy 1127 ERHQVQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

I I I I I I I I I I I I I I I : I I I I M I I I I I I I I I I I I I : 
Db 337 ERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 373 



Search completed: September 3, 2004, 16:30:01 
Job time : 122.801 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: September 3, 2004, 15:59:39 ; Search time 73.0975 Seconds 

(without alignments) 
5019.975 Million cell updates/sec 

Title: US-09-830-972-2 
Perfect score: 5848 

Sequence: 1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 1163 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 1017041 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL 25:* 



1 




sp archea:* 


2 




sp bacteria:* 


3 




sp fungi : * 


4 




sp human : * 


5 




sp invertebrate:* 


6 




sp mammal : * 


7 




sp mhc:* 


8 




sp organelle:* 


9 




sp phage:* 


10 


sp plant:* 


11 


sp rodent:* 


12 


sp virus : * 


13 


sp vertebrate:* 


14 


sp unclassified:* 


15 


sp rvirus:* 


16 


sp bacteriap:* 


17 


: sp archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


5312.5 


90. 


8 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


2 


5307 


90. 


7 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


3 


4501.5 


77. 


0 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


4 


3627.5 


62. 


0 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


5 


3299.5 


56. 


4 


720 


11 


Q7TNB7 


Q7tnb7 mus musculu 


6 


2926 


50. 


0 


639 


11 


Q8K290 


Q8k290 mus musculu 


7 


2610 


44. 


6 


578 


11 


Q80W95 


Q8 0w95 mus musculu 


8 


1416 


24. 


2 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


9 


1314.5 


22. 


5 


356 


11 


Q8BH78 


Q8bh7 8 mus musculu 


10 


1304 


22. 


3 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


11 


1283.5 


21. 


9 


392 


4 


Q96B16 


Q96bl6 homo sapien 


12 


878 


15. 


0 


184 


6 


Q7YRW9 


Q7yrw9 bos taurus 


13 


876 


15. 


0 


199 


13 


Q7T224 


Q7t224 gallus gall 


14 


844 


14. 


4 


179 


6 


Q9GM33 


Q9gm33 macaca fasc 


15 


792 


13. 


5 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


16 


788 


13. 


5 


780 


11 


Q8K0T0 


Q8k0t0 mus musculu 


17 


762.5 


13. 


0 


760 


13 


Q90638 


Q90638 gallus gall 


18 


737 


12. 


6 


214 


13 


Q7T222 


Q7t222 carassius a 


19 


700 


12. 


0 


643 


11 


Q8CCU2 


Q8ccu2 mus musculu 


20 


685 


11. 


7 


199 


4 


Q9BQ59 


Q9bq59 homo sapien 


21 


671 


11. 


5 


267 


11 


Q63765 


Q63765 rattus sp. 


22 


669 


11. 


4 


208 


13 


Q90637 


Q90637 gallus gall 


23 


625.5 


10. 


7 


236 


11 


Q8VBU0 


Q8vbu0 rattus norv 


24 


625.5 


10. 


7 


237 


11 


Q8C6D5 


Q8c6d5 mus musculu 


25 


590.5 


10. 


1 


221 


13 


Q7ZUD6 


Q7zud6 brachydanio 


26 


586 


10. 


0 


595 


5 


Q9VMV9 


Q9vmv9 drosophila 


27 


532 


9. 


1 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


28 


520 


8. 


9 


202 


5 


Q9VMW2 


Q9vmw2 drosophila 


29 


520 


8. 


9 


222 


5 


Q9VMW4 


Q9vmw4 drosophila 


30 


520 


8. 


9 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


31 


517 


8. 


8 


2484 


5 


Q9U347 


Q9u347 caenorhabdi 


32 


503.5 


8. 


6 


2607 


5 


Q23187 


Q23187 caenorhabdi 


33 


377 


6. 


4 


2768 


5 


Q9VC00 


Q9vc00 drosophila 


34 


347.5 


5. 


9 


10578 


5 


Q8ISF5 


Q8isf5 caenorhabdi 


35 


344 


5. 


9 


107 


13 


Q7T223 


Q7t223 carassius a 


36 


.343.5 


5. 


9 


18519 


5 


Q8ISF6 


Q8isf6 caenorhabdi 


37 


343.5 


5. 


9 


18534 


5 


Q8ISF7 


Q8isf7 caenorhabdi 


38 


342.5 


5. 


9 


1417 


3 


Q871Y7 


Q871y7 . neurospora 


39 


335 


5. 


7 


5412 


5 


Q9W596 


Q9w596 drosophila 


40 


331 


5. 


7 


4900 


5 


Q9N541 


Q9n541 caenorhabdi 


41 


328.5 


5. 


6 


5327 


5 


076891 


076891 drosophila 


42 


322 


5. 


5 


7962 


4 


Q10465 


Q10465 homo sapien 


43 


320 


5. 


5 


222 


5 


Q23188 


Q23188 caenorhabdi 


44 


317.5 


5. 


4 


17352 


5 


Q95YM2 


Q95ym2 procambarus 


45 


313.5 


5. 


4 


1444 


5 


Q9VTN2 


Q9vtn2 drosophila 



ALIGNMENTS 



RESULT 1 
Q8BGM9 

ID Q8BGM9 PRELIMINARY; PRT; 1162 AA. 

AC Q8BGM9; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 



DE RTN4 . 

GN RTN4 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N..A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/ DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 129SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/ DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7 ; 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/ DDB J databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7 ; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY102284; AAM73506.1; 

DR EMBL; AY102286; AAM73511.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO:0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. , 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1162 AA; 126613 MW; 855697FBEE117 81F CRC64; 

Query Match 90.8%; Score .5312.5; DB 11; Length 1162; 

Best Local Similarity 91.0%; Pred. No. 9.1e-259; 

Matches 1064; Conservative 37; Mismatches 55; Indels 13; Gaps 6; 

MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I M I I I M I M I I I M I I I I M II It : I I I I I I I I I I I I I I I I I I I 
MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

P AAG L S AAAVP P AAAAP L LD F S S D S VP PAP RG P L P AAP P AAP E RQ P S W E RS P AAP AP S L P 120 
I I I M II I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
P AAGL S AAP VP P - AAAP LLD FS S D S VP PAP RG P L P AAP PTAP ERQ P S WERS P AAS AP S L P 118 

PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

PAAAVLPSKLPEDDEPPAR— PPAPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 176 

AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I 
AASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 



Qy. 


1 


Db 


1 


Qy 


61 


Db 


60 


Qy 


121 


Db 


119 


Qy 


181 


Db 


177 



At, 

wy 


9 4 1 




237 


Qy 






?Q7 


Qy 


i r a 




^ S7 


Qy 


/lie; 
41b 




J. / 


Qy 


4 /O 


UD 


fl / / 


Qy 


c; Q £ 


DD 


^^7 


Qy 


C Q £T 


UD 


RQ7 


Qy 


bob 


nk 
DD 


£^7 
UJ / 


Qy 


716 


nk 
DD 


71 7 
/ 1 / 


Qy 


7 "7 ^ 


nk 

DD 


777 
fit 


Qy 


Q ^ ^ 


nk 

DD 


Oj / 


Qy 


ft Q R 

o y d 




894 


Qy 


955 


Db 


954 


Qy 


1015 


Db 


1014 



GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

M I I I I : I : I I I I I I I I I I I I : I I I! I I M I I I II : I I I I I I I I I I I I I I I I I M II 
GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

AILVENTKEEVIVRSKDKEDLVCSAALHSPQESP VGKEDRWSPEKTMDI FNEMQ 355 

I : I I I I I II I I I I I I M I I I I I I I I I II : I I I I I I III I : I I I I I I I I I I I I : 



MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 415 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I : I I I I I I I I I M 

MS WAPVREE YAD FKP FEQAWEVKDT YEGS RDVLAARANMES KVDKKCFEDSLEQKGHGK 416 

DSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKI 47 5 

Ml 111:1111 III I II I I I I II I IhlMIII II I I : I I I I I I I I I I I I I I I 

DSESRNENASFPRTPELVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKI 47 6 



I I I I I I I I I I I I I I I I I I I II I I : I I I I I I I I I I I I I II I I I : I I I I I I I I I I I I I 



CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 595 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I M I I I I I I I I I I I I I I I 

CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 

PLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK7UiGTKEGIK 655 

I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I : I I M 

PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIK 65 6 

EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 715 

M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I : I I I I I II I I II I I I I : I 111:11 
EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELA/DDS 716 

SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPY 774 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml : I I M I I I I I : I II I 

SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 77 6 

LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 
MINIMI Ml I I 11:11 MMI I I I I I M I I I I I II I II I I M I I I I I : M I I I I I 



DSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSF 8 94 

I I I I I I I II I I II I I I I II I I I I I I I I I I M : II II I I : I I I I : I I I I I I I I I M I 

DSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSF 893 

KNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPS 954 

II I I I I I II I I I I I :: I I II I I : I I II II I I I I : I I I I I I I I I h II I I 
KNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPS 953 

DTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI 1014 

I I I I I I I I I : I I II I I I : I I I I I I II I II I I I I I I II I II I I II I I I II I I I I I I I I I II ' 

DTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI 1013 

ALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI 1074 

I I I I || II I I I I I I II I I I I I I I I I I I II M I I I I I II II I I I I I I II II I I II I I I I II 

ALT^LLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI 1073 



Qy 

Db 

Qy 

Db 



1075 KELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQID 1134 

I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I 1 I I I I I II I I I I I I I I I I I I I I I III 
1074 KELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQAQID 1133 

1135 HYLGIANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I II I I I I II I I I I I II I I II I I I I : 
1134 HYLGLANKSVKDAMAKIQAKIPGLKRKAE 1162 



RESULT 2 
Q8K3G8 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



PRELIMINARY ; 



PRT; 1163 AA. 



Created) 

Last sequence update) 
Last annotation update) 



Q8K3G8 
Q8K3G8; 

01-OCT-2002 (TrEMBLrel. 22, 
01-OCT-2002 (TrEMBLrel. 22, 
01-OCT-2003 (TrEMBLrel. 25, 
Nogo-A. 
RTN4 . 

Mus musculus (Mouse) . 
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=BALB/ c; 

RA Jin W., Long M. , Li R. , Ju G. ; 

RT "Cloning and expression of . the mouse Nogo-A protein."; 
RL Submitted (MAY-2002) to the EMBL/ GenBank/ DDB J databases. 
DR EMBL; AY114152; AAM77068.1; 
DR MGD; MGI : 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
DR GO; GO: 0007399; P : neurogenesis ; IDA. 
DR InterPro; IPR003388; Reticulon. 
DR Pfam; PF02453; Reticulon; 1. 
DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F362799417EA4 CRC64; 



Query Match 90.7%; Score 5307; DB 11; Length 1163; 

Best Local Similarity 91.1%; Pred. No. 1.7e-258; 

Matches 1066; Conservative 34; Mismatches 56; Indels 14; 



Gaps 



7; 



Qy 

Db 

Qy 

Db 



1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I : I I I I I I I I I I I I II I I I I I 

1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 



60 



59 



120 



61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 

I I I I I I I III I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I Mill 
60 PAAGLSAVPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 



Qy 



Db 



121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I II I I I I I I I I II I I I II I II I I I I I I I I I I II I I I I I I II I I M II I I I I I I I I I I 
119 PAAAVLPSKLPEDDEPPAR— PPAPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 17 6 



Qy 

Db 



181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

I II I I I II I I I I I I I I I I LI I I I I I I I I I I I I I I I II I I I I I I I I I I II II I I I I I I I 

177 AASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 



241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I M I I : I :! I II I I I I I ! I I : I I I I I I I I I I I I I : I I I I M I I I I I I I I I I I I M I 
237 GNLSAVASTEGTIEETLNEAS RELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

301 AI LVEN T KE E VI VRS KD KE D LVC S AALH S P Q E S P VGKEDRWSPEKTMDIFNEMQ 355 

I : I I I I I I I I I I I I I I I I I I I II I I II I : I I I II I I II I : I I I I I I I I I I I I : 

297 AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMK 356 

356 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 415 

I I I I II I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I : I I I I I : I I I I I I I I II II 

357 MS VVAPVREE YADFKP FEQAWEVKDT YEGS RDVLATUWJMES KVDKKCFEDS LEQKSHGK 416 

416 DSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKI 475 

III 111:11111111 III I I I I I I I I I I I I I I I I II I I : I I I II I I I I I I I I I I 

417 DSESRNENASFPSTPELVKDGSRAYITCDSFTSATESTTVANIFPVLEDHTSENKTDEKKI 476 

476 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 535 

I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I I I I I I I II I I I : I I II I I I I I I I II 

477 EERKAQIITEKTSPKTSNPFLVAIHDSEADYWTDNLSKVTEAWATMPEGLTPDLVQEA 536 

536 CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIA/MEA 595 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I M 

537 CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 

596 PLNSLLP SAGAS VVQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIK 655 

I I I I I I I I I I I I III I I I I I I I I I I I II I I I I I I I I I I I I : I I I I M M 

597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDAKEEIK 656 

656 EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEEAELVEDS 715 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 111111111111111:1 I I I : I I 

657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEKSVPDHCELVDDS 716 

716 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPY 7 74 

I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I III : I I I I I I I I I : I I I I 

717 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 77 6 

775 LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 
777 LESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFS 8 36 

835 DSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSF 8 94 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I : I I II : I I I I I I I I I I I I 

837 DSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSF 893 

895 KNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPS 954 

II I I I I I I I I M I I : : I I I I II : I I I I II I I I I : I I I I I I I I I I : I I I I 

894 KNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPS 953 

955 DTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGW-FGASLFLLLSLTVFSIVSVTAY 1013 

I I I I I I I I I : I I I I I II : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
954 DT EKEDRS LTAVLSAELNKT S WDLLYWRDI KKTGWYFGAS LFLLLS LTVFS I VS VTAY 1013 

1014 IALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNST 1073 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I 

1014 IALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNST 1073 

1074 I KE L RRL FLVD D LVDS LK FAVLMWVFT YVGAL FN GLT L L I LAL I S L F S I PVI Y E RH QVQ I 1133 



Db 1074 IKELRRLFLVDDLVDSLKFAVLMWVFTWGALFNGLTLLILALISLFSIPVI YERHQAQI 1133 

Qy 1134 DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I : 
Db 1134 DHYLGLANKSVKDAMAKIQAKI PGLKRKAE 1163 



RESULT 3 




Q8BGK7 




ID 


Q8BGK7 PRELIMINARY; 


PRT; 1046 AA. 


AC 


Q8BGK7; 




DT 


01-MAR-2003 (TrEMBLrel. 23, 


Created) 


DT 


01-MAR-2003 (TrEMBLrel. 23, 


Last sequence update) 


DT 


01-OCT-2003 (TrEMBLrel. 25, 


Last annotation update) 


DE 


RTN4 . 




GN 


RTN4. 




OS 


Mus mus cuius (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata 


; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodent ia 


; Sciurognathi; Muridae; Murinae; Mus. 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


RA 


Oertle T . , van der Putten H. 


, Schwab M. E. ; 


RT 


"Genomic Structure and Functional Characterization of the Promoter 


RT 


Structures of Human and Mouse Nogo/Rtn-4 . " ; 


RL 


Submitted (OCT-2002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


RA 


Oertle T., Schwab M.E.; 




RL 


Submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[3] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7; 




RA 


Van der Putten H . ; 




RL 


Submitted (MAY-2 002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[4] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129SvcJ7; 




RA 


Van der Putten H., Mir A.; 




RL 


Submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


DR 


EMBL; AY102280; AAM73502.1; 




DR 


EMBL; AY102286; AAM73507.1; 




DR 


MGD; MGI: 1915835; Rtn4 . 




DR 


GO; GO: 0005783; C : endoplasmic reticulum; IDA. 


DR 


GO; GO: 0007399; P : neurogenesis ; IDA. 


DR 


InterPro; IPR003388; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; PS50845; RETICULON; 


i 1. 


SQ 


SEQUENCE 1046 AA; 114221 


MW; 8CE2E2238ED51222 CRC64 ; 



Query Match 77.0%; Score 4501.5; DB 11; Length 1046; 

Best Local Similarity 86.9%; Pred. No. 4.8e-218; 

Matches 910; Conservative 40; Mismatches 68; Indels 29; Gaps 



4; 



Qy 


143 


Db 


3 


Qy 


183 


Db 


63 


Qy 


243 


Db 


123 


Qy 


303 


Db 


183 


Qy 


358 


Db 


243 


Qy 


418 


Db 


303 


Qy 


478 


Db 


363 


Qy 


538 


Db 


423 


Qy 


598 


Db 


483 


Qy 


658 


Db 


543 


Qy 


718 


Db 


603 


Qy 


111 


Db 


663 


Qy 


837 


Db 


723 


Qy 


897 


Db 


780 


Qy 


957 



PPPAGASPLAEPAAPPSTPA : APKRRGSGSVDETLFALPAA 182 

|| || |: |: | | I :: I I I I I I I I I 

P P LAGGGQKGGAAS EAWVP S L FVGVS G ST CTAAKS LVP I PARS S RL S AARNET L FAL P AA 62 

SEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGN 242 

I I I I I I I I I M I I I I I I I I I I I I II M I I I I I I I I I II I I I I I I I M I I I I M I I II I 
SEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYLGN 122 

LSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI 3 02 

I I I I : I : I I I I I I I I I I I I : I I I I I II III I I I : I I I I I I I I I I I I M I I II I I II : 
LSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGESAM 182 

LVENTKEEVIVRSKDKEDLVCSAALHSPQESP VGKEDRWSPEKTMDIFNEMQMS 357 

I I I I I I I I I I I I I I I I I I I I II I I I I : III I I I IN I : I I I M I II I I I I : I I 

LVENTKEEVIVRSKDKEDLVCST^ALHNPQESPATLTKVVKEDGVMSPEKTMDIFNEMKMS 2 42 

WAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDS 417 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II M : I I I I I : I I I I I I II I I I II 
VVAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGKDS 302 

EGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEE 477 

I Mhllll III III I II I I I I M:IMIM II I I : I I I I I I I I M I I I I I I I 

ESRNENASFPRTPELVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKIEE 362 

RKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACE 537 

I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I II I I I I : I I I I I I I I I I I I I I I 
RKAQ IITEKTSPKTSNPF L VAI H D S EAD YVT T DN L S KVT EAWATM P E G LT P D L VQ EAC E 422 

SELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPL 597 
I I I I I I I I I I I I I I I I I II I I I I I I I I II : I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
SELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEAPL 482 



I M I I I MM Ml MM I I I II I II I I I I I I I I I I I II : I II I : I I I I II 



657 
542 



ESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSP 717 

I I I I II M I I I I II II II I II I I I I I I I I M M I II I I I I II I I M I M I I I M II I 
ESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDSSP 602 



II I I II I II M II I I I I I I I I I I I I I I I I M I I I I I I I I II : II II I I I I I : I I I I I I 
ESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPYLE 662 

SFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDS 836 

|| I I II I II I I II I M II I I I II I II I I M II I II I II I II I M I I II M I I II I I II 
SFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAI YSNDDLLSSKEDKMKESETFSDS 722 

SPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I || M I I I II I I I II I II I I M I I I II II M II II I : I II I M I II II I I I II I I I 
SPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSFKN 77 9 

IYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDT 956 

Mill I I II I I I I I I M I : I I I II I I II I M I I I I I I I I I : M I II I 

TYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPSDT 839 

957 EKEDRSLSAVLSAELS KT S WDLLYWRDI KKTGWFGAS LFLLLSLTVFS I VSVTAYI AL 1016 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



840 



I I I I I M : I I I I I I I : I I I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I > I I I I > I M I 
EKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAL 



1017 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I I I I I M I I 
900 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKE 



899 



1076 



959 



1077 L RRL FL VD D LVD S L K FAVLMWVFT YVGAL FN G LT L L I LAL I S L F S I P VI Y E RHQ VQ I DH Y 1136 

I I I I I I I I I I II I I I I II I I II I I I I I I I M M I I M I I M I I I I I I I I I I I I I I I I I I 

960 L RRL FL VD D LVD S L K FAVLMVTVFT YVGAL FN G LT L L I LAL I S L F S I P VI YE RHQ AQ I DH Y 1019 

1137 LGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I I I I I II I I I I I : 
1020 LGLANKSVKDAMAKIQAKIPGLKRKAE 1046 



RESULT 
Q8IUA4 



ID 

AC 

DT 

DT 

DT 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RA 

RT 

RT 

RL 

RN 

RP 

RA 

RL 

RN 

RP 

RA 

RL 

RN 

RP 

RC 

RX 

RA 

RT 

RT 

RL 

DR 

DR 

DR 

DR 

DR 

DR 



PRELIMINARY ; 



(TrEMBLrel. 
(TrEMBLrel. 
(TrEMBLrel. 



Q8IUA4 
Q8IUA4; 
01-MAR-2003 
01-MAR-2003 
01-JUN-2003 
RNT4 (RTN4) . 
RTN4 . 

Homo sapiens (Human 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 
Oertle T., van der 
"Genomic Structure 



PRT; 



986 AA. 



23, Created) 

23, Last sequence update) 

24, Last annotation update) 



Chorda ta ; 
Primates ; 



Putten H. 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Schwab M.E. ; 

and Functional Characterization of the Promoter 
Structures of Human .and Mouse Nogo/Rtn-4 . " ; 
Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 
[2] 

SEQUENCE FROM N.A. 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 
Van der Putten H.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[4] 

SEQUENCE FROM N.A. . ' 

TISSUE=Testis; 

MEDLINE=22 376540; PubMed= 12 488097; 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 
Structures of Human and Mouse Nogo/Rtn-4."; 
J. Mol. Biol. 325:299-323(2003). 
EMBL; AY102285; AAM64244.1; 
EMBL; AY123245; AAM64249.1; 
EMBL; AY123246; AAM64250.1; 
EMBL; AY123247; AAM64251.1; 
EMBL; AY123248; AAM64252.1; 
EMBL; AY123249; AAM64253.1; 



DR EMBL; AY123250; AAM64254.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 986 AA; 108449 MW; 0CDE8F647036415A CRC64 ; 

Query Match 62.0%; Score 3627.5; DB 4; Length 986; 

Best Local Similarity 75.4%; Pred. No. 3.9e-174; 

Matches 745; Conservative 96; Mismatches 126; Indels 21; Gaps 13; 

Qy 195 MDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIE 254 

Ml | | | | M : I : I I I I I I I I M I I I I I I I M I I M I I I I I I I I I I I I : I I I : : 
D b 1 MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 60 

Qy 255 ETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVR 314 

| : : M | | | : I : I : : I I I I I I I I I I I I I II I I III I I I - I I =11 = 11 = 

Db 61 ENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK 120 

Q y 315 SKDKED-LVCSAALHSPQESPVG KEDRWS PEKTMDI FNEMQMSWAPVREEYAD 368 

:||:|: M : ||: II I III III II I III :::| IIMIIIII 

D b 121 NKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 180 

Qy 369 FKPFEQAWEVKDTYEGSRDVLAA RANVESKVDRKCLEDSLEQKSLGKDSEGRNEDA 424 

| | | | | : I I || I : I I I : I I I : I : I I I I I : I INN: MM Ml 

D b 181 FKPFERWEVKDSKEDS-DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDT 239 

Qy 425 SFPSTPEPVKDSSRAYITCAS FT-SATESTTANTFPLLEDHTSENKTDEKKIEERKAQII 483 

M I I II I Ml I MUM I Mill I I I I I I I II II M M II II M II I : 
Db 240 SFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIV 299 

Qy 484 TEK-TSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNE 542 

|| | || | I M I II II II I I I M I II I M II I I M I II II I I I II I I M II I II I 
Db 300 TEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNE 359 

Qy 543 ATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLP 602 

M II I II I II M II II II =111111 I M I M I M M II I II I II I I II I II II I M 
Db 360 VTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVP 419 

Qy 603 SAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKAL-GTKEGIKEPESFN 661 

II II II M II II II I I M M II II I II II I II II M M I : I M MIIMI 
Db 42 0 SAGASVIQPS SS PLEA- S SVN YES I KHEPENPPP YEEAMSVSLKKVSGI KEEI KEPENIN 47 8 

Qy 662 AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 721 

I I Ml II II M M I I I I I I I I I I I II : I I I I M I I M I I : MM: II I M II 

Db 479 AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 538 

Qy 722 VDLFSDDSI PEVPQTQEEAVMLMKES LTEVS-ETVAQHK- EERLSAS PQELGKPYLES FQ 77 9 

| M II I II II M II I M I I I M I II I I I I : : - : M M I I I I M I II M I : 
Db 539 VDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFK 598 

Qy 780 PNLHSTKDA-ASNDIPTLTKKEKISL'QMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSP 838 

M M II : :: I I Ml I I I Mill M I M M I II III M M M I I II I II 
Db 599 LSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSP 658 



Qy 



839 



IEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI 8 97 
I | | | II I I I M M II I II M I M M I I I I II I I M I I II I II I I I I II 



Db 



659 IEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNI 718 



Qy 898 YPK — DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVXSKSLTKEAEKKLPSD 955 

|| ::: ||:||:| |: II : I :|||| I I : I I I I I I I I I I I I I I I I 
Db 719 QPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALATQAEIESIVKPKVLVKEAEKKLPSD 778 

Qy. 956 TEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA 1015 

I I I M I I M : I I II II I I I I I M I I I I I I I I I I I I I I M I I I II I I I I I I I I I M I II 
Db 779 TEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA 838 

Qy 1016 LALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTIK 1075 

I | | | | | | I I I I I I I I I I I I I I I I I I I I I I I M I IN I I I I I I I I I I I I I I I I I I I I I M 
Db 839 LALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIK 898 

Qy 1076 E L RRL FLVD D LVD S LKFAVLMWVFT YVGAL FN GLT LL I LAL I S L F S I P VI YE RH QVQ I DH 1135 

I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I : I I I i I I I I I I I I 

D b 899 ELRRLFLVDDLVDS LKFAVLMWVFT YVGAL FNGLTLL I LAL I SLFS VP VI YERHQAQ I DH 958 

Qy 1136 YLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

I I I I I II : I I I I I I I I I I I I M I I I I I : 
Db 959 YLGLANKNVKDAMAKIQAKIPGLKRKAE 986 



RESULT 5 
Q7TNB7 

ID Q7TNB7 PRELIMINARY; PRT; 72 0 AA. 

AC Q7TNB7; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID^10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; , 

RA . Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B . , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J.,. Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , Butterfield Y.S., 

RA Krzywinski M.I.-, Skalska U. , Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 



RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RA Strausberg R. ; 

RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. 

DR • EMBL; BC056373; AAH56373.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 720 AA; 77435 MW; 80AB7 8728F16EAB2 CRC64; 



Query Match 56.4%; Score 3299.5; DB 11; Length 720; 

Best Local Similarity 90.3%; Pred. No. 7.9e-158; 

Matches 654; Conservative 22; Mismatches 39; Indels 9; Gaps 4; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I I I I I II I I I I I I 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGL SAAAVP PAAAAP LLD FS S D S VP PAP RGP L PAAP PAAP ERQ P S WERS P AAP AP S L P 120 

I I I II I I I III 11111111111111111111111111 I I I I I I I I I I I I II I I I I I 
Db 60 P AAGL S AAP VP P - AAAP L L D F S S D S VP PAP RG P L PAAP P TAP ERQPSWERS P AAS AP S L P 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — P P AP AGAS P LAE PAAP P ST PAAP KRRG S G S VD ET L FAL P 176 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLET7VASLPSLSPLSTVSFKEHGYL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 177 AASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I : I : I I I I II I I I I I I : I I I I I I I I M I I I : I I I I I I I I I I I I I I I I I I I I I 
Db 237 GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

Qy 301 AI LVENTKEEVI VRS KDKEDLVCSAALHS PQES P VGKEDRWSPEKTMDIFNEMQ 355 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I III I : I M I I I I I I II I : 

Db 297 AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMK 356 

Qy 356 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 415 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I : I I I I I : M I I I I I I I II 
Db 357 MSVVAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGK ,416 

Qy 416 DSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKI 475 

III I I I : I I I I III III I I I I I I I I I : I I I I I I II I I : I I I I I I I I I I I I I I I 
Db 417 DSESRNENASFPRTPELVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKI 476 

Qy 476 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 535 

I I I I I I I I I II I II I I I I I I I I I : M I I I I II It I I I I I I I I : I I I I I I I I I I I I I 
Db 477 EERKAQIITEKTSPKTSNPFLVAIHDSEADYWTDNLSKVTEAWATMPEGLTPDLVQEA 536 

Qy 536 CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 595 

I I I I I I I I I II I I I I I I I I I I II I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 537 CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 

Qy 596 PLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIK 655 

I I I II I I I I I I I III MM I Mill I I I I I I I I I I I I II I : I II I Ml M 



Db 



597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIK 656 



Qy 656 EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 715 

I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I : I I I I I I I I I I I I I I I : I I II : I I 
Db 657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDS 716 

Qy 716 SPES 719 

MM, 
Db 717 SPES 720 



RESULT 6 
Q8K290 

ID Q8K2 90 PRELIMINARY; PRT; 639 AA. 

AC Q8K290; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

•DT 01-OCT-2003 (TrEMBLrel, 25, Last annotation update) 

DE Hypothetical protein. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; BC032192; AAH32192.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 639 AA; 70312 MW; 309A19DA37603F11 CRC64; 



Query Match 50.0%; Score 2926; DB 11; Length 639; 

Best Local Similarity 91.6%; Pred. No. 4e-139; 

Matches 58 8; Conservative 20; Mismatches 30; Indels 4; Gaps 2; 

Qy 523 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEA 582 

II I I II I II I I I I I I I I II I I I I I I II II I I I I I I I I I M II I I : I I I I I II I M I II II 

Db 1 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEA 60 

Qy 583 TPSPVLPDIVTCEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMN 642 

II I I II I II I I I I II I I II II Mil III II I I I II I I I I I I I I I I I II II II I : 
Db 61 TPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMS 120 



Qy 643 VALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFE 7 02 

I II I II I I I I I I I I I I II I I I I I I I I I I I I I I II I II I I I I : I II I I I I I II I 

Db 121 VALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFE 180 

Qy 703 KSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS ETVAQHK-EE 7 61 

MUM I I I : I I I I II I I I I I I II I I I I M I I II I I I I I I I II I I II II I II III : I 
Db 181 KSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKE 240 



Qy 762 RLSASPQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLS 821 

I I I I I I I I : I I I I I I I I I I I I I II I I I I I : I I I I I I I I I. I I I I I I I I I I I I I I I M I I 
Db 241 RLSASPQEVGKPYLESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLS 300 

Qy 822 SKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGAD 881 

I I I I I : I I I I II I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I : II I I I I : I I I I : 
Db 301 SKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGAN 357 

Qy 882 SLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKS 941 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I : I I I 
Db 358 SLPCSELPCDLSFKNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKP 417 

Qy 942 KSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLS 1001 

I I I I I I I : I I I I I I I I I I I I I : I I I I II I : I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 418 KVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASLFLLLS 477 

Qy 1002 LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQK 1061 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 47 8 LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQK 537 

Qy 1062 YSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFS 1121 

I I 1 I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 538 YSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG7U^FNGLTLLILALISLFS 597 

Qy 1122 I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I : 
Db 598 I PVI YERHQAQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAE 639 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
FT 
SQ 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi; 
Sciurognathi ; Muridae; Murinae; Mus. 



RESULT 7 
Q80W95 

ID Q80W95 PRELIMINARY; PRT; 578 AA. 

Q80W95; 

01-JUN-2003 (TrEMBLrel. 24, 
01-JUN-2003 (TrEMBLrel. 24, 
01-OCT-2003 (TrEMBLrel. 25, 
Nogo-A (Fragment) . 
NOGO-A. 

Mus mus cuius (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N . A. 
Tozaki H. , Hirata T.; 

"The partial sequence of mouse nogo-A cDNA clone#4109." 
Submitted (OCT-2001) to the EMBL/GenBank/DDB J databases 
EMBL; AB073672; BAC75974.1; 

GO; GO: 0005783; C : endoplasmic reticulum; IEA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 
NONJIER 1 1 

SEQUENCE 578 AA; 63696 MW; 832670C171E4AC61 CRC64; 



Query Match 



44.6%; Score 2610; DB 11; Length 578; 



Best Local Similarity 90.7%; Pred. No. 2.7e-123; 
Matches 527; Conservative 18; Mismatches 32; Indels 



4; Gaps 



2; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



58 4 PSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNV 643 

I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I II I : I 
- 1 PSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSV 60 

644 ALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEK 703 

III II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I 

61 ALKTSDAKEEIKEPESFNAAAQEAEAPYI SIACDLIKETKLSTEPSPGFSNYSEIAKFEK 12 0 

704 SVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EER 762 
I I I : I I I I : I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I III : I I 

121 SVPDHCELVDDS S PESEPVDLFS DDS I PEVPQTQEEAVMLMKESLTEVSETVTQHKHKER 180 

763 LSASPQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSS 822 

1111111:1111111111111 j I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 LSASPQEVGKPYLESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSS 24 0 

823 KEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADS 882 

I I II : I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I : I I I I I I : I I I I : I 
241 KEDKMKESETFSDSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANS 297 

883 LPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSK 942 

III I I I I I I I I I I I I I I I I I I I I I I :: I I I I I : I I I I I I I I I I : I I I I 
298 LPC3ELPCDLSFKNTYPKDEAHVSDEFSKSRSSVFKVPLLLPNVSALESQIEMGNIVKPK 357 

943 SLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSL 1002 

I I I I I I : II I I I I I I I I I I I : I I I I I I I : I I I I I I I I I I I I I I I I I II I I I II I I I I I I 
358 VLTKEAEEKLPSDTEKEDRSLTAVXSAELNKTSWDLLYWRDIKKTGWFGASLFLLLSL 417 

1003 TVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKY 1062 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
418 TVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKY 477. 

1063 SNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMV^FTWGA 1122 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I 
478 SNSALGHWSTIKELRRLFLVT)DLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSI 537 

1123 PVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I : 
538 PVI YERHQAQI DH YLGLANKS VKDAMAKI QAKI PGLKRKAE 578 



RESULT 8 
Q8BHF5 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 



OC 
OX 



Created) 

Last sequence update) 
Last annotation update) 



Q8BHF5 PRELIMINARY; PRT; 375 AA. 

Q8BHF5; 

01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-OCT-2003 (TrEMBLrel. 25, 
RTN4 . 
RTN4. 

Mus mus cuius (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
NCBI TaxID-10090; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7„ and 129SvcJ7; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AY102282; AAM73504.1; -. 

DR EMBL; AY102286; AAM73509.1; - 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 375 AA; 40300 MW; 23D9EB19BE67 1AE6 CRC64; 



Query Match 24.2%; Score 1416; DB 11; Length 375; 

Best Local Similarity 31.6%; Pred. No. 1.7e-63; 

Matches 367; Conservative 2; Mismatches 6; Indels 788; Gaps 4 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I II I I 
Db 119 PAAAVLPSKLPEDDEPPAR — P PAPAGAS P LAE P AAP P S T PAAP KRRGS GS VDET L FAL P 176 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL. 240 

I I I I I I I I I I I 

Db 177 AASEPVIPSSA— 187 



Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 
Db 188 187 



Qy 



301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 



361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

188 187 

421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTTVNTFPLLEDHTSENKTDEKKIEERKA 480 

188 _ 187 

481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

188 187 

541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

188 ■ ~ 187 

601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

188 187 

661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

188 187 

721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 7 80 

188 187 

781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 84 0 

188 187 

841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

188 __ 187 

901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSTVKSKSLTKEAEKKLPSDTEKED 960 

188 187 

961 RSLSAVLSAELS KT S WDLLYWRDI KKTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLS 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

18 8 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 232 

1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

233 VTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 292 

1081 FLVDDLVDSLKFAVLJyiWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

II I I I II I I I I I I I I I I I I I I I I I II I I I II I I M I I I I II I I I I I I I I I I I I I I I I I I 

293 FLVDDLVDSLKFAVLMWFTWGALFNGLTLLILAJilSLFSIPVIYERHQAQIDHYLGLA 352 



1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 
I I I I I I I I I I I II I I I I I I I II : 



Db 353 NKSVKDAMAKIQAKIPGLKRKAE 375 



RESULT 9 
Q8BH78 



ID 


Q8BH78 PRELIMINARY; 


PRT; 356 AA. 


AC 


Q8BH7 8; 




DT 


01-MAR-2003 (TrEMBLrel. 23, 


Created) 


DT 


01-MAR-2003 (TrEMBLrel. 23, 


Last sequence update) 


DT 


01-OCT-2003 (TrEMBLrel. 25, 


Last annotation update) 


DE 


RTN4 . 




GN 


RTN4 . 




OS 


Mus musculus "(Mouse) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia 


; Sciurognathi; Muridae; Murinae; Mus. 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N . A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


RA 


Oertle T., van der Putten H. 


, Schwab M.E. ; 


RT 


"Genomic Structure and Functional Characterization of the Promoter 


RT 


Structures of Human and Mouse Nogo/Rtn-4 . " ; 


RL 


Submitted (OCT-2002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[2] 




RP 


SEQUENCE FROM N . A. 




RC 


STRAIN=12 9/SvcJ7, and 129SvcJ7; 


RA 


Oertle T. f Schwab M.E.; 




RL 


Submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[3] 




RP 


SEQUENCE FROM N . A. 




RC 


STRAIN=129/SvcJ7; 




RA 


Van der Putten Re- 




RL 


submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[4] 




RP 


SEQUENCE FROM N. A. 




RC 


STRAIN=129SvcJ7; 




RA 


Van der Putten H., Mir A.; 




RL 


Submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


DR 


EMBL; AY102281; AAM73503.1; 




DR 


EMBL; - AY102286; AAM73508.1; 




DR 


MGD; MGI: 1915835; Rtn4 . 




DR 


GO; GO: 00057 83; C : endoplasmic reticulum; IDA. 


DR 


GO; GO: 0007399; P : neurogenesis ; IDA. 


DR 


InterPro; IPR003388; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; PS50845; RETICULON; 


: 1. 


SQ 


SEQUENCE 356 AA; 38403 MW; 4366C03BA9630B56 CRC64; 



Query Match 22.5%; Score 1314.5; DB 11; Length 356; 

Best Local Similarity 29.9%; Pred. No. 2e-58; 

Matches 348; Conservative 2; Mismatches 6; Indels 807; Gaps, 4; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 



61 P AAGL S AAAVP P AAAAP LLDFSSDSVP PAP RG P L P AAP P AAP E RQ P S W ERS P AAP AP S L P 120 



1 1 1 1 1 1 1 1 III I II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

I I I I I I I I I I I I I I I I I II II I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 119 PAAAVLPSKLPEDDEPPAR — P PAP AGAS P LAE PAAP P S T P AAP KRRGS GS V 168 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Db 169 168 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Db ' 169 168 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Db 169 • 168 

Qy 361 PVREEYADFKP FEQAWEVKDT YEGS RDV1AARANVES KVDRKCLEDS LEQKS LGKDS EGR 42 0 

Db 169 168 

Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Db 169 168 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Db 169 168 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db . 169 168 

Qy 601 LPSAGAS WQPSVS PLEAPP PVS YDS I KLEPENPPP YEEAMNVALKALGTKEGIKEPES F 660 

Db 169 : 168 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Db 169 ■ 168 

Qy 721 PVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS PQELGKP YLES FQP 780 

Db 169 168 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 169 168 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 169 ■ — — 168 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVST^lLEPQTEMGSIVKSKSLTKETVEKKLPSDTEKED 960 



Qy 



Db 




Qy 



1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 



Db 



214 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKELRRL 273 




Qy 



1081 FLVDDLVDSLKFAVXMVOVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 




Db 



274 FLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVIYERHQAQIDHYLGLA 333 



Qy 



1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 




Db 



334 NKSVKDAMAKIQAKI PGLKRKAE 356 



RESULT 10 

Q8K3G7 

ID Q8K3G7 



PRELIMINARY; 



PRT; 



357 AA. 



AC Q8K3G7; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-B. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-BALB/ c ; 

RA Jin W., Li R., Long M. , Shen J., Ju G.; 

RT "Cloning and expression of the mouse Nogo-B protein."; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY114153; AAM77069.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 357 AA; 38566 MW; 73BB3D17DFDBDF15 CRC64; 

Query Match 22.3%; Score 1304; DB 11; Length 357; 

Best Local Similarity 29.9%; Pred. No. 6.6e-58; 

Matches 348; Conservative 2; Mismatches 6; Indels 8 08; Gaps 5; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I II I I I I I I I I 

Db 1 MEDI DQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 



Qy 



61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 
I I I I I I 11 III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

M I I I I I I I I I I I I I I I I I II II I M I I M I I I I I I I I I I I I II I I I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — P PAP AGAS P LAE P AAP P S T P AAP KRRG S G S V 168 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Db 169 1 68 

QY 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Db 169 1 68 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Db 169 1 68 

Qy 361 PVREE YADFKP FEQAWEVKDT YEGS RDVLAARANVES KVDRKCLEDS LEQKS LGKDS EGR 42 0 

Db 169 — 168 

Qy 421 NEDAS FPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 48 0 

Db 169 168 

QY 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Db 169 168 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db 169 168 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Db 169 168 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Db 169 168 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 169 168 

QY 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 169 168 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI7\NIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 169 168 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 169 : < 168 



Qy 961 RS L S AVLS AEL S KT S WD L L YWRD I KKT GW- FGAS L FL LL S LT VFS I VS VT AY I ALALL 1019 

I I I I I M I I I I I II I I I I | I | I I | M M I I I I I I I II I I I I I I I 
Db 169 WDLLYWRDIKKTGWYFGASLFLLLSLTVFSIVSVTAYIALALL 213 

QY 1020 SVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRR 1079 

I M I I I I I I I I I I II I I I I I I I I I I I I I I | | | | | | | | | | | | | | M I I I M I I I I I I I I I I 
Db 214 S VT I S FRI YKGVI QAI QKS DEGH P FRAYLES EVAI S EELVQKYSNSALGHVNS T I KELRR 273 

Qy 1080 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGL 1139 

I I I I M I I I I I I II II I I I | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | 
Db 274 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVI YERHQAQIDHYLGL 333 

Qy 1140 ANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I M I I I I I I I I I I I I I I I M I : 
Db 33$ ANKSVKDAMAKIQAKI PGLKRKAE 357 



RESULT 11 
Q96B16 

ID Q96B16 PRELIMINARY; PRT ; 392 AA. 

AC Q96B16; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel . 24, Last annotation update) 

DE Hypothetical protein (RTN4) . 

GN RTN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22376540; PubMed-12 4 88 097 ; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL J. Mol. Biol. 325:299-323(2003). 



DR EMBL; BC016165; AAH16165.1; -* 

DR EMBL; AY102285; AAM64242.1; -. 

DR EMBL; AY102278; AAM64247.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 392 AA; 42274 MW; D7B2AA5E839E58AD CRC64; 

Query Match 21.9%; Score 1283.5; DB 4; Length 392; 

Best Local Similarity 29.4%; Pred. No. 8.1e-57; 

Matches 34 6; Conservative 11; Mismatches 20; Indels 801; Gaps 7; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I 11111111 I I I I I I I I I I I : I I :: I I I II I I I I I I I I 
Db 1 MEDLDQS PLVS S S -DS PPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL S AAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I : II : I 

Db 59 PAAGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L PAAP P VAP ERQPSWDPS PVS S T VP 118 

Qy . 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

III I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I III 

Db 179 RGSSGSVDETLFALPAASEPVIRSSA 204 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Db 205 204 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PE 345 

Db 205 204 

Qy * 34 6 KTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLE 4 05 

Db 205 204 

Qy 406 DSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHT 4 65 

Db 205 : ■ . 204 

Qy 466 SENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPE 525 

Db 205 204 

Qy 526 GLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPS 585 

Db 205 204 

Qy 586 PVLPDIVMEAPLNSLLPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVAL 645 



Qy 

Db 



646 KALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSV 705 
205 204 



Qy 7 06 PEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSA 765 

Db 205 204 

Qy 766 SPQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKED 825 

Db 205 204 

Qy 826 KIKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPC 885 

Db 205 204 

Qy 886 LELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLT 945 

Db 205 : 204 

Qy 94 6 KEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1005 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 205 — WDLLYWRDIKKTGWFGASLFLLLSLTVF 234 

Qy 1006 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I II 

Db 235 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 294 

Qy . 1066 ALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVI 1125 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I : I I I 

Db 295 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 354 

Qy 1126 YERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I I : I I I II I I I I I I I I I I I I II : 
Db 355 YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 392 



RESULT 12 
Q7YRW9 

ID Q7YRW9 PRELIMINARY; PRT; 184 AA. 

AC j Q7YRW9; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4w (Fragment) . 

GN RTN4 . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12 8322 88 ; 

RA Oertle T . , Klinger M. , Stuermer C.A. , Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 



RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164744; AAP47319.1; 

FT NON_TER 1 1 

SQ SEQUENCE 184 AA; 20671 MW; DE990E03BBAF84A1 CRC64; 

Query Match 15.0%; Score 878; DB 6; Length 184; 

Best Local Similarity 96.2%; Pred. No. 6.9e-37; 

Matches 177; Conservative 3; Mismatches 4; Indels 0; Gaps 0; 

Qy 980 LYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSD 1039 

I I I I M I I I I I I I I I I I I I I I I I I I I I M M I I I I I I M I I I I I I I I I I I I I II I I I I 
Db 1 LYWRDIKKTGWFGXXLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSD 60 

Qy 1040 E GH P FRAYL E S EVAI S E ELVQ K Y S N S ALGHVN S T I KE LRRL FLVDD LVD S L K FAVLMWVF 1099 

I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 EGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVF 120 

Qy 1100 TWGALFNGLTLLIIJVLISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLK 1159 

I I I I I I I I I M I I I I I II I I I I : I I I I I I I I M I I I I I I I I I : I I I I I I I I I I I M I I i 

Db 121 TWGALFNGLTLLIL7VLISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 180 

Qy 1160 RKAD 1163 

III: 

Db 181 RKAE 184 



RESULT 13 
Q7T224 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



Created) 

Last sequence update) 
Last annotation update) 



Q7T224 PRELIMINARY; PRT; 199 AA. 

Q7T224; 

01-OCT-2003 (TrEMBLrel. 25, 
01-OCT-2003 (TrEMBLrel. 25, 
01-OCT-2003 (TrEMBLrel. 25, 
RTN4-C. 
RTN4 . 

Gallus gallus (Chicken) . 
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Archosauria; Aves ; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 
OC Gallus. 
OX NCBI_TaxID=9031; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12832288 ; . 

RA Oertle T . , Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164737; AAP47312.1; 

SQ SEQUENCE 199 AA; 22293 MW; 07CF4E4EF2723251 CRC64; 



Query Match 15.0%; Score 876; DB 13; Length 199; 

Best Local Similarity 90.6%; Pred. No. 9.7e-37; 

Matches 173; Conservative 10; Mismatches 8; Indels 0; 



Gaps 



0; 



Qy 



973 KTSVAn)LLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYI7VLALLSVTISFRIYKGVI 1032 
I II I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I 



Db 9 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 68 

Qy 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

|||||||||||||:||:||:|:||||: I I I = I MINIM Ml 

Db 69 QAIQKSDEGHPFRAYLESDVAVSEDLIQKYSSWLGHINGTVKELRRLFLVDDLVDSLKF 12 8 

Qy 1093 AVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQ 1152 

M I I I I I I I I I I I I I I I I I I I M I I I I I I : I I I I I M I III Ihlllllllll 

Db 129 AVLMWVFTYVGALFNGLTLLII^ 188 

Qy 1153 AKI PGLKRKAD 1163 

I I I I I I I I I : 
Db 189 AKIPGLKRKTE 199 



RESULT 14 
Q9GM33 

ID Q9GM33 PRELIMINARY; PRT; 179 AA. 

AC Q9GM33; 

DT 01-MAR-2001 (TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel . 25, Last annotation update) 

DE Hypothetical protein. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Osada N., Hida M. , Kusuda J. , Tanuma R. , Iseki K., Hirai M. , Terao K. 

RA Suzuki Y., Sugano S., Hashimoto K. ; 

RT "Isolation of full-length cDNA clones from macaque brain cDNA 

RT libraries. "; 

RL Submitted (OCT-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AB049853; BAB16739.1; 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF0245.3; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 179 AA; 19949 MW; 



5F8CD4383FEE9E02 CRC64; 



Query Match 14.4%; Score 844; DB 6; Length 179; 

Best Local Similarity 95.5%; Pred. No. 3.4e-35; 

Matches 171; Conservative 6; Mismatches 2; Indels 0; 



Gaps 



Qy 

Db 

Qy 

Db 



985 I KKTGWFGAS LFLLLS LTVFS I VSVTAYI AIALLS VT I S FRI YKGVI QAI QKS DEGHP F 

: | | | | | | | I I I II I I I I I I I I I I I I I I I I I I M I I I I I I : I I I I I I I I I I M I I I 

1 MKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYRGVIQAIQKSDEGHPF 

1045 RAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGA 

| I | I Ml I I I I I I II I I I I I II I I M I I I I I I II I I I I I I I M I I : I I I I I I I I I M M 
61 RAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLRFAVLMWVFTYVGA 



0; 



1044 



60 



1104 



120 



Qy 



1105 LFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 



I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I II I I I I I : I I I I I II I I I I I I I I I M I : 

Db 121 l™gltllilalislfsvpviyerhqaqidhylglanknvkdamakiqakipglkrkae 179 



RESULT 15 
Q8K4S4 

ID Q8K4S4 PRELIMINARY; PRT; 780 AA. 

AC Q8K4S4; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Reticulon 1A. 

GN RTNl OR RTN-1A. 

OS Mus mus cuius (Mouse),. 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ICR; TISSUE=Brain; 

RA Hirata T., Nomura T . , Takagi Y. , Sato Y., Tomioka N . , Fujisawa H., 

RA Osumi N. ; 

RT "Mosaic development of the olfactory cortex with Pax6-dependent and - 

RT independent components."; 

RL Submitted (NOV-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AB074899; BAB96551.1; 

DR MGD; MGI: 1933947; Rtnl. 

DR GO; GO:0005783; C : endoplasmic reticulum; IDA. 

DR InterPro; IPR001951; Histone_H4. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HISTONE__H4; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 780 AA; 83504 MW; 545F5638C576A069 CRC64 ; 

Query Match 13.5%; Score 792; DB 11; Length 780; 

Best Local' Similarity 32.5%; Pred. No. l.le-31; 

Matches 242; Conservative 105; Mismatches 260; Indels 138; Gaps 27; 

Qy 517 EAAVSNMPEGLTPDLVQEACESELNEATG TKIAYETKVDLVQTSEAIQ-ESLY 568 

I I : I | | : : | | : | : : I : I : : I I : : 

Db 76 ETASTGM — AAVPDALDHSPSSTLKDGEGACYTSLISDVCYPPREDSAYFTGILQKENGH 133 

Qy 569 PTAQLCPSFEEAEATPSPVLPDIVMEAP LNS LLPSAGASWQPSVSPLEAPP 620 

I I I I I I I I I I : : I 1:1 : I : I : II: 

Db 134 ITTSESP — EEPE-TPGPSLPEVPGMEPQGLLSSDSGIEMTPAESTEVNKILADPLDQMK 190 

Qy 621 PVSYDSIKLE PENPPPYEEAMNVALKALGTK EGIKEPESFNAAV 664 

: I I : I I I : : I I I : I I : : I II 

Db 191 AEAYKYIDITRPQEAKGQEEQHPGLEDKDLDFKDKGTEVSTKAEGVRAPNQ-PAPVEGKL 249 

Qy 665 QETEAPYISIACDLIKETKLSTEPSPDFSNYSEI AKFEKSVPEHAELV 712 

: I II I I I : : : : I : I : I I : : : : I I : I 

Db 250 IKDHLFEESTFAPYIDELSD — EQHRVSLVTAPVKITLTEIEPPLMTATQETIPEKQDLC '307 

Qy 713 EDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK EERLSASP 767 

I I : : I I : I : : I I I I : I I : I : : I 



Db 



308 LKPSPDTVPTVTVSE PEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIK 361 



Qy 768 QELGKPYLESFQPN LHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSS 822 

: I I : I I : :ll: ' J ' 

D b 362 EAKGLSYETTESPRPVGQVADKPKTKTRSGLPTIPS PLDQEASS 405 

Qy 823 KEDKIKESETFSDSSPIEIIDEFPT FVSAKDDSPKLAK-EYTDLEVSDKSEIAN 875 

I | | |: |: |,|: II I I :h I "I: " 

Db 4 06 AESGDSEIELVSE-DPMASEDALPSGYVSFGHVSGPPPSPASPSIQYSILREEREAEL— 462 

Qy 876 IQSGADSLPCLELPCDL-SFKNIYPKDEVH VSDEFSENRSSVSKASISPSNVS 927 

II : I II I Ml I I I : : I I : 

Db ■ 463 DSELIIE-SCDASSASEESPKREQDSPPMKPGALDAI REETGSRATEERAPSHQG 516 



QY 



928 ALEPQTEMGSIVTCSKSLTKEAEKKL PSDTEKEDRSLSAVLSAE 97 0 

:|| II : :| -I | :::: III:: 

Db 517 PVEPD-PMLSFAPAAALQSRPEPSSGDGASVPEPPRSQQQKPEEEAVSSSQSPTATEIPG 575 



QY 



971 LSKTSVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYI7VLAL 1018 

: | : I I I I I I M I : I I : I I I : I II I I I I I : I I I I I : I M 
Db 576 PLGSGLMPPLPFFNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSVVSVVAYLALT^ 635 



Qy 1019 LSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELR 1078 

|| 111 I : I I : I I : I I I I I I : I I I I I: :|:l :lll:: :IIII:IIM 

Db 636 LSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELR 695 

Qy 1079 RLFLVDDLVDSLKFAVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLG 1138 

Mill I I I I I I I I I I I I I: lllllllllllll::|::|:|::ll:l :|| |:| III 
Db 696 RLFLVQDLVDSLKFAVI^LLTWGALFNGLTLL 755 

Qy 1139 LANKSVKDAMAKIQAKIPGLKRKAD 1163 

I : : I I I I II I I I I I I : 

Db 756 LVRT H I NT WAKI QAKI P GAK RHAE 78 0 



Search completed: September 3, 2004, 16:08:45 
Job time : 84.0975 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



September 3, 2004, 15:58:54 ; Search time 15.4629 Seconds 

(without alignments) 
3916.307 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 1163 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



141681 seqs, 52070155 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database : SwissProt_42 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


5848 


100, 


.0 


1163 


1 


RTN4_RAT 


Q9jkll 


rattus norv 


2 


4403.5 


75, 


.3 


1192 


1 


RTN4_HUMAN 


Q9nqc3 


homo sapien 


3 


917 


15 


.7 


199 


1 


RTN4_MOUSE 


Q99p72 


mus musculu 


4 


801 


13 


.7 


777 


1 


RTN 1_RAT 


Q64548 


rattus norv 


5 


789.5 


13 


.5 


776 


1 


RTN1 HUMAN 


Q16799 


homo sapien 


6 


625. 5 


10 


.7 


236 


1 


RTN3_HUMAN 


095197 


homo sapien 


7 


625.5 


10 


.7 


237 


1 


RTN3_MOUSE ' 


Q9es97 


mus musculu 


8 


509 


8 


.7 


471 


1 


RTN2_MOUSE 


070622 


mus musculu 


9 


490 


8 


.4 


545 


1 


RTN 2 HUMAN 


075298 


homo sapien 


10 


308.5 


5 


.3 


5147 


1 


PCLO_HUMAN 


Q9y6v0 


homo sapien 


11 


300 


5 


.1 


5120 


1 


PCLO_CHICK 


Q9pu36 


gallus gall 


12 


295.5 


5 


.1 


865 


1 


CPN_DROME 


Q02910 


drosophila 


13 


291.5 


5 


.0 


2459 


1 


MAP B_RAT 


P152 05 


rattus norv 


14 


288 


4 


.9 


2468 


1 


MAPB_HUMAN 


P46821 


homo sapien 


15 


284 


4 


.9 


2464 


1 


MAPB_MOUSE 


P14873 


mus musculu 


16 


282.5 


4 


.8 


2805 


1 


MAPA HUMAN 


P78559 


homo sapien 


17 


279 


4 


.8 


5038 


1 


PCLO MOUSE 


Q9qyx7 


mus musculu 



18 


277 


4 . 


7 


3644 


1 


MINT MOUSE 


Q62504 


mus musculu 


19 


275.5 


4. 


7 


3924 


1 


ANK2_HUMAN 


Q01484 


homo sapien 


20 


275 


4. 


7 


1972 


1 


P 53 INHUMAN 


Q12888 


homo sapien 


21 


272 . 5 


4 . 


7 


3664 


1 


MINT HUMAN 


Q96t58 


homo sapien 


22 


272 


4. 


7 


5085 


1 


PCLO^RAT 


Q9jks6 


rattus norv 


23 


267 


4. 


6 


1781 


1 


AK12_HUMAN 


Q02952 


homo sapien 


24 


265.5 


4. 


5 


1828 


1 


MAP 2 MOUSE 


P20357 


mus musculu 


25 


261.5 


4. 


5 


1087 


1 


NFH MOUSE 


P19246 


mus musculu 


26 


259.5 


4. 


4 


1026 


1 


NFH_HUMAN 


P12036 


homo sapien 


27 


259.5 


4. 


4 


2404 


1 


SON_MOUSE 


Q9qx47 


mus musculu 


28 


258.5 


4. 


4 


1861 


1 


MAP2_RAT 


P15146 


rattus norv 


29 


257.5 


4. 


4 


2715 


1 


MLL4_HUMAN 


Q9umn6 


homo sapien 


30 


256 


4. 


4 


1723 


1 


AIM1_HUMAN 


Q9y4kl 


homo sapien 


31 


255.5 


4. 


4 


1140 


1 


YM96_YEAST 


Q04893 


saccharomyc 


32 


254.5 


4. 


4 


6632 


1 


UN8 9_CAEEL 


001761 


caenorhabdi 


33 


254 


4. 


3 


1616 


1 


P200 MYCGE 


Q49429 


mycoplasma 


34 


254 


4. 


3 


3381 


1 


PGCV_BOVIN 


P81282 


bos taurus 


35 


253 


4. 


3 


3421 


1 


TEGU_HSVEB 


P28955 


equine herp 


36 


252.5 


4. 


3 


1411 


1 


TCOF__HUMAN 


Q13428 


homo sapien 


37 


252 


4. 


3 


2738 


1 


PGCV RAT 


Q9erb4 


rattus norv 


38 


250.5 


4. 


3 


4377 


1 


ANK3_HUMAN 


Q12955 


homo sapien 


39 


248.5 


4. 


2 


1189 


1 


YJH6 YEAST 


P47035 


saccharomyc 


40 


248 


4. 


2 


1827 


1 


MAP2_HUMAN 


P11137 


homo sapien 


41 


245.5 


4. 


2 


1001 


1 


IF2_SYNY3 


P72689 


synechocyst 


42 


244 


4. 


2 


1338 


1 


ACIN MOUSE 


Q9jix8 


mus musculu 


43 


241.5 


4. 


1 


8545 


1 


AN CI CAEEL 


Q9n4m4 


caenorhabdi 


44 


238.5 


4. 


1 


831 


1 


NFH_RAT 


P16884 


rattus norv 


45 


238.5 


4. 


1 


1341 


1 


ACIN_HUMAN 


Q9ukv3 


homo sapien 



ALIGNMENTS 



RESULT 1 
RTN4_RAT 

ID RTN4_RAT STANDARD; PRT; 1163 AA. 

AC Q9JK11; Q9JK10; Q9R0D9; Q9WUE9; Q9WUF0; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Glut4 vesicle 20 kDa protein) . 

GN RTN4 OR NOGO. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND PARTIAL SEQUENCE. 

RC STRAIN=Sprague-Dawley; TISSUE=Adipocyte; 

RX MEDLINE-99249816; PubMed=1023155.7 ; 

RA Morris N.J., Ross S.A., Neveu J.M., Lane W.S., Lienhard G.E.; 

RT "Cloning and characterization of a 22 kDa protein from rat adipocytes: 

RT a new member of the reticulon family."; 

RL Biochim. Biophys . Acta 1450:68-7 6(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 



i 



RX MEDLINE=20129258; PubMed=106677 96 ; 

RA Chen M.S., Huber A.B., Van der Haar M.E., Frank M. , Schnell L. , 

RA Spillmann A.A. , Christ F. , Schwab M.E.; 

RT "Nogo-A is a myelin-associated neurite outgrowth inhibitor and an 

RT antigen for monoclonal antibody IN-1."; 

RL Nature 403:434-439(2000). 

RN [3] 

RP SEQUENCE FROM N . A. (ISOFORMS 2 AND 4). 

RC STRAIN=Wistar Kyoto; TISSUE^Vascular smooth muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in rat: one of two 

RT minor splice variants."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP FUNCTION. 

RX MEDLINE=22033691; PubMed-12037567 ; 

RA GrandPre T., Li S., Strittmatter S.M.; 

RT "Nogo-66 receptor antagonist peptide promotes axonal regeneration."; 

RL Nature 417:547-551(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Narne=l; Synonyms =Nogo-A, NI-220-250; 

CC IsoId=Q9JKll-l; Sequence=Displayed; 

CC Name=2; Synonyms=Nogo-B, Foocen-Ml; 

CC IsoId=Q9JKll~2; Sequence=VSP_005658 ; 

CC Name=3; Synonyms=Nogo-C, VP20; 

CC , IsoId=Q9JKll-3; Sequence=VSP_005656, VSP_005657; 

CC Name ^4 ; Synonyms=Foocen-M2 ; 

CC IsoId=Q9JKll-4; Sequence=VSP_005659; 

CC -!- TISSUE SPECIFICITY: Isoforms 1, 2 and 3 are present in optic 
CC nerve, spinal cord and cerebral cortex. Isoforms 1 and 2 are 

CC present in dorsal root ganglion, sciatic nerve and PC12 cells 

CC after longer exposure. Isoforms 2 and 3 are detected in kidney, 

CC cartilage, skin, lung and spleen. Isoform 3 is expressed at high 

CC level in skeletal muscle. In adult animals isoform 1 is expressed 

' CC mainly in the nervous system. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC . or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF051335; AAF01564.1; -. 

DR EMBL; AJ242961; CAB7 1027.1; 

DR EMBL; AJ242962; CAB71028.1; 



DR 


EMBL; AJ242963; 


CAB71029. 1; 






DR 


EMBL; AF132045; 


AAD31019.1; 






DR 


EMBL; AF132046; 


AAD31020.1; 






DR 


GO; GO: 0030176; 


C: 


integral 


to endoplasmic reticulum membrane; IDA. 


DR 


GO; GO: 0005635; 


C: 


nuclear membrane; ISS. 




DR 


GO; GO: 0005515; 


F: 


protein binding; ISS. 




DR 


GO; GO:0019987; 


P: 


negative 


regulation of anti-apoptosis ; 


ISS. 


DR 


GO; GO: 0030517; 


P: 


negative 


regulation of axon extension; 


ISS. 


DR 


InterPro; 


IPR00338 


8; Reticulon. 




DR 


Pfam; PF02453; Reticulon; 1 






DR 


PROSITE; 


PS50845; 


RETICULON; 1. 




KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


DOMAIN 


1 




989 


CYTOPLASMIC (Potential). 




FT 


TRANSMEM 


990 




1010 


POTENTIAL. 




FT 


DOMAIN 


1011 




1104 


LUMENAL (Potential) . 




FT 


TRANSMEM 


1105 




1125 


POTENTIAL. 




FT 


DOMAIN 


1126 




1163 


CYTOPLASMIC (Potential) . 




FT 


DOMAIN 


976 




1163 


RETICULON. 




FT 


DOMAIN 


33 




46 


POLY-GLU. 




FT 


DOMAIN 


73 




76 


POLY- ALA. 




FT 


DOMAIN 


140 




145 


POLY- PRO. 




FT 


VARSPLIC 


1 




964 


Missing (in isoform 3) . 




FT 










/FTId=VSP_005656. 




FT 


VARSPLIC 


965 




975 


AVLSAELSKTS -> MDGQKKHWKDK 


(in isoform 


FT 










3) . 




FT 










/FTId=VSP_005657. 




FT 


VARSPLIC 


173 




975 


Missing (in isoform 2). 




FT 










/FTId=VSP__005658 . 




FT 


VARSPLIC 


192 




975 


Missing (in isoform 4). 




FT 










/FTId=VSP 005659. 




FT 


CONFLICT 


1130 




1131 


MISSING (IN REF. 3; AAD31020) . 


SQ 


SEQUENCE 


1163 


AA; 126386 


MW; 8CB894B09E94F0B6 CRC64; 





Query Match 100.0%; Score 5848; DB 1; Length 1163; 

Best Local Similarity 100.0%; Pred. No. 3.9e-213; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II II I I I I I I I I I I I I I I I I I I 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 



Qy 121 PAAAVLPSKLPEDDEPPARPPPPP PAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I I II I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 



Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I 
Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 



Qy 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 360 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I 
Db 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FN EMQMSWA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I II 
Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 481 QIITEKTSPKTSNPFLVAVQDSEADYWTDTLSKWEAAVSNMPEGLTPDLVQEACESEL 540 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 601 LPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 7 80 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ) I I I I I I I I I I I I 
Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

M I I I I I I II I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 RSLSAVLSAELSKTSVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLS 102 0 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

M II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

Qy 1081 FLVDDLVDS LKFAVLMWVFT YVGAL FNGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLA 1140 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I M I I I I I I I I I II I I I I I I I 
Db 1081 FLVDDLVD S LKFAVLMWVFT YVGAL FNGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



Db 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 



RESULT 2 
RTN4_HUMAN 

ID RTN4_HUMAN STANDARD; PRT; 1192 AA. 

AC Q9NQC3; 094962; Q9BXG5; Q9H212; Q9H3I3; Q9UQ42; Q9Y293; Q9Y2Y7; 

, AC Q9Y5U6; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 
DE (Neuroendocrine-specific protein) (NSP) (Neuroendocrine specific 

DE protein C homolog) (RTN-x) (Reticulon 5) (My043 protein) . 

GN RTN4 OR NOGO OR ASY OR KIAA088 6. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RX MEDLINE=20129242; PubMed^l 06677 80; 

RA Prinjha R. , Moore S.E., Vinson M. , Blake S., Morrow R. , Christie G. , 

RA Michalovich D., Simmons D.L., Walsh F.S.; 

RT "Inhibitor of neurite outgrowth in humans."; 

RL Nature 403:383-384(2000). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2) . 

RC TISSUE=Brain; 

RX MEDLINE-21010696; PubMed-11126360; 

RA Tagami S., Eguchi Y., Kinoshita M. , Takeda M. , Tsujimoto Y. ; 

RT "A novel protein, RTN-XS, interacts with both Bcl-XL and Bcl-2 on 

RT endoplasmic reticulum and reduces their anti-apoptotic activity."; 

RL Oncogene 19:5736-5746(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=2 02 37 5 4 2 ; PubMed=l 07 7 3680; 

RA Yang J., Yu L., Bi A.D., Zhao S.-Y.; 

RT "Assignment of the human reticulon 4 gene (RTN4) to chromosome 

RT 2pl4 — >2pl3 by radiation hybrid mapping."; 

RL Cytogenet. Cell Genet. 8 8:101-102(2 000). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 4). 

RA Jin W.-L. , Ju G. ; 

RT "Developmentally-regulated alternative splicing in a novel Nogo-A. " ; 

RL Submitted (NOV-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Placenta, and Skeletal muscle; 

RA Ito,T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in human."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/ DDB J databases. 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Fibroblast; 

RA Yutsudo M. ; 



RT "Isolation of a cell death-inducing gene."; 

RL Submitted (JUN-1998) to the EMBL/GenBank/DDB J databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RC TISSUE=Pituitary; 

RA Song H., Peng Y., Zhou J., Huang Q. , Dai M. , Mao Y.M./Yu Y. , Xu X., 

RA Luo B., Hu R., 'Chen J.; 

RT "Human neuroendocrine-specif ic protein C (NSP) homolog gene."; 

RL Submitted (JUL-1998) to the EMBL/GenBank/DDB J databases. 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y. , Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y., 

RA Yu J., Han L.H. ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth."; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDB J databases. 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain; 

RX MEDLINE=99156230; PubMed-1004 84 85; 

RA Nagase T., Ishikawa K.-I., Suyama M. , Kikuno R. , Hirosawa M. , 

RA Miyajima N., Tanaka A. , Kotani H., Nomura' N., Ohara 0.; 

RT "Prediction of the coding sequences of unidentified human genes. XII. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; — ^ * CuO f\ 

RL DNA Res. 5:355-364(1998). 3 I ©orOO&f 1 tHX 

RN [10] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Brain, Ovary, Pancreas, Placenta, and Skeletal muscle; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA St.apleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA/ Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [11] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RX MEDLINE-20499367; PubMed=11042 152 ; 

RA Zhang Q.-H., Ye M. , Wu X.-Y., Ren S.-X., Zhao M. , Zhao C.-J., Fu G., 

RA Shen Y. , Fan H.-Y., Lu G., Zhong M. , Xu X.-R., Han Z.-G., Zhang J.-W., 

RA Tao J., Huang Q.-H., Zhou J., Hu G.-X., Gu J., Chen S.-J., Chen Z.;- 



RT "Cloning and functional analysis of cDNAs with open reading frames for 

RT 300 previously undefined genes expressed in CD34 + hematopoietic 

RT stem/progenitor cells."; 

RL Genome Res. 10:1546-1560(2000). 

RN [12] 

RP SEQUENCE OF 482-1192 FROM N.A. (ISOFORM 1/4). 

RC TISSUE-Brain; 

RA Mao Y.M., Xie Y. , Zheng Z.H.; 

RL Submitted (MAY-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [13] 

RP SEQUENCE OF 186-1192 FROM N.A. (ISOFORM 1) . 

RC TISSUE-Testis; 

RA Sha J.H., Zhou Z.M., Li J.M.; 

RL Submitted (JAN-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [14] 

RP TOPOLOGY. 

RC TISSUE=Brain; 

RX MEDLINE=20129259; PubMed-10667797 ; 

RA GrandPre T., Nakamura F., Vartanian T., Strittmatter S.M.; 

RT "Identification of the Nogo inhibitor of axon regeneration as a 

RT Reticulon protein."; 

RL Nature 403:439-444(2000). 

RN [15] 

RP FUNCTION. 

RC TISSUE^Brain; 

RX MEDLINE=21069055; PubMed=11201742 ; 

RA Fournier A.E., Grandpre T., Strittmatter S.M.; 

RT "Identification of a receptor mediating Nogo-66 inhibition of axonal 

RT regeneration."; 

RL Nature 409:341-346(2001). 

RN [16] 

RP REVIEW. 

RX MEDLINE=21888956; PubMed=11891768 ; 

RA Ng C.E.L., Tang B.L.; 

RT "Nogos and the Nogo-66 receptor: factors inhibiting CNS neuron 

RT regeneration."; 

RL J. Neurosci. Res. 67:559-565(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults. 

CC Isoform 2 reduces the anti-apoptotic activity of Bcl-xl and Bcl-2 . 

CC This is likely consecutive to their change in subcellular 

CC location, from the mitochondria to the endoplasmic reticulum, 

CC after binding and sequestration. 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum. Anchored to the membrane of the endoplasmic reticulum 

CC through 2 putative transmembrane domains . 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event=Alternative- splicing; Named isoforms=4; 

CC Name=l; Synonyms=RTN 4 A, Nogo-A, RTN-xL; 

CC IsoId=Q9NQC3-l; Sequence=Displayed; 

CC Name=2; Synonyms^RTN 4B, Nogo-B, RTN-xS, Foocen-M; 

CC IsoId=Q9NQC3-2; Sequence=VSP_005655 ; 

CC Name=3; Synonyms=RTN 4C, Nogo-C, Foocen-S; 

CC IsoId=Q9NQC3-3; Sequence-VSP_005 652 , VSP_005653; 

CC Name=4 ; 

CC IsoId=Q9NQC3-4; Sequence=VSP_005654 ; 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



-!- TISSUE SPECIFICITY: Isoform 1 is specifically expressed in brain 
and testis and weakly in heart and skeletal muscle. Isoform 2 is 
widely expressed excepted for the liver. Isoform 3 is expressed in 
brain, skeletal muscle and adipocytes. Isoform 4 is testis- 
specific. 

-!- SIMILARITY: Contains 1 reticulon domain. 

-!- CAUTION: Ref.ll sequence differs from that shown due to 
frameshifts in positions 1149 and 1156. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 



AJ251383; 
AJ251384; 
AJ251385; 
AB040462; 
AB040463; 
AF148537; 
AF148538; 
AF087901; 
AF320999; 
AF132047; 
AF132048; 
AB015639; 
AF077050; 
AF177332; 
AB020693; 
BC001035; 
BC007109; 
BC014366; 



CAB99248.1 
CAB99249. 1 
CAB99250. 1 
BAB18927.1 
BAB18928 
AAG12176 
AAG12177 
AAG12205 
AAG40878 
AAD31021 
AAD31022 
BAA83712 
AAD27783 
AAG17976 
BAA74909 
AAH01035 
AAH07109 
AAH14366 



Query Match 75.3%; Score 4403.5; DB 1; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 9.1e-159; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

QY 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

IIMII I I I I I I I I I I I I I I I I I I I I I I | | | I I I I I : I I :: I I II I I I I I I I I | 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

QY 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I M I I I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I I I | | | : M : I 
Db 59 PAAGL S AAP VPTAPAAGAP LMD FGN D FVP PAP RGPL PAAP P VAP ERQ P S WD P S PVS ST VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

I M I I I I I I I I I I II I I I I I I I I I I I I I I M II I I I I I I I I II I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I M I I I I I II I I II I I I I I It I I III II I 111:1:1 I I III I I I I I I I M I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVXLETAASLPS 237 



Qy 



226 LSPLSTVS FKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 



I I I • • I '■Mill- I • I ••III I I I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

QY 286 SEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

I I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I I I III 
Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Mill I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I | | : | | | : | : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 39 6 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 4 54 

11111:11 Mill: III! Ml I I I I II I Ml I II I I II I Mill 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 47 6 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I M I I I I I I I I I I I I I I M I II M I I II II II M II I II I I II II II I: 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

Ml I I : I M II I I I I I I I II I II I I I I I I I I I II I I I M I M II I M I I I I I Ml 
Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I M M I I I I I I I I II I I II I II I : I I I I I I I : I M Mill I : I : I I I II I I 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

II I I I I I I M M I : I II Mill: II I M I I I I II I II II I I I I I II || I I : I I I 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENIN7UVLQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

1-111 = 11 I : I I M M II I II II M I I II I I I I II II M II Ml I II M I I I II I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : M M I I I I I I I II I II : : I : I I I : : : I I : I I I I I IMM 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: M M I I I I I III : I: I M I II II I I I I I II I II I MM II I II M M I M I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 8 69 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASISPSNV 92 6 

I I I I I I I I I I I I I I I I I I II I II : : : M M I M I : I I ■ : I M 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLS7VELSKTSWDLLYWRDIK 98 6 

Ml I I : I I I I I I I I II I I II I II I I II I I II: I I I II I I I I I I I II II I I I 
Db 956 SALATQAEIESIVKPKVLVKE7VEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I M II II I I II II I II I I I I || I I I I || || I I | M || | | | | M | || | | M I I II I I II I I 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

M I II II I I II I I II I II I I II II I I II I II I II II I II M I I I I I I I I I I II II I I I I 



Db 



1076 YL E S E VAI SEE LVQ K Y S N S AL GHVNCT I KE L RRL FL VD D LVD S L K FAVLMWVFT YVGAL F 1135 



Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I.: I I I I I I I I I I I I I I I I I M : 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 3 
RTN4_MOUSE 

ID RTN4_MOUSE STANDARD; PRT; 199 AA. 

AC Q99P72; Q9CTE3 ; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) . 

GN RTN4 OR NOGO. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=3T3-L1; TISSUE=Adipocyte; 

RA Coulson A.C., Craggs P.D., Morris N.J.; 

RT "Mouse vp20/RTN4C cDNA. " ; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/ DDB J databases. 

RN [2] 

RP SEQUENCE OF 170-199 FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE-Embryo ; 

RX MEDLINE=21085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A., Shibata K- f Yoshino M. , Itoh M. , Ishii Y., 

RA Arakawa T., Hara A., Fukunishi Y., Konno H., Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M. , Nishi K. , Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y., Gojobori T., Bono H . , Kasukawa T., Saito R. , 

RA Kadota K. , Matsuda H.A., Ashburner M. , Batalov S., Casavant T., 

RA Fleischinann W., Gaasterland T . , Gissi C, King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G., Quackenbush J., 

RA Schriml L.M., Staubli F. , Suzuki R. , Tomita M. , Wagner L., Washio T., 

RA Sakai K., Okido T., Furuno M. , Aono H., Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M. , Gariboldi M. , ' 

RA Gustincich S., Hill D., Hofmann M. , Hume D.A. , Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N . , 

RA Sasaki H., Sato K. , Schoenbach C, Seya T., Shibata Y., Storch K.-F., 

RA Suzuki H., Toyo-oka K. , Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A., Yoshida K., Hasegawa Y. , Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 
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DR 

DR. 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



transmembrane domains (By similarity) . 
ALTERNATIVE PRODUCTS: 

Event=Alternative splicing; Named isoforms=l; 

Comment=A number of isoforms may be produced; 
Name-1 ; 

IsoId=Q99P72-l; Sequence=Displayed; 
SIMILARITY: Contains 1 reticulon domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF326337; AAK08076.1; 

EMBL; AK003859; NOT_ANNOTATED_CDS . 

MGD; MGI: 1915835; Rtn4 . 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

GO; GO:0030176; C:integral to endoplasmic reticulum membrane; ISS. 
GO; GO: 0005635; C:nuclear membrane; ISS. 
GO; GO: 0005515; F:protein binding; ISS. 

GO; GO: 0019987; P: negative regulation of anti-apoptosis ; ISS. 
GO; GO: 0030517; P:negative regulation of axon extension; ISS. 
GO; GO: 0007399; P : neurogenesis ; IDA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

Endoplasmic reticulum; Alternative splicing; Transmembrane. 



DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
DOMAIN 
SEQUENCE 



1 
26 
51 
138 
163 
12 
199 AA; 



25 
50 
137 
162 
199 
199 



CYTOPLASMIC (Potential). 
POTENTIAL. 

LUMENAL (Potential) . 
POTENTIAL. 

CYTOPLASMIC (Potential) . 
RETICULON. 



22466 MW; 07BE5D58 0059ED9C CRC64; 



Query Match 15.7%; 
Best Local Similarity 97.9%; 
Matches 187; Conservative 



Score 917; DB 1; Length 199; 
Pred. No. 1.5e-28; 
1; Mismatches 3; Indels 



0; Gaps 



0; 



Qy 



Db 



973 KTSVVTLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1032 
I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I 
9 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 68 



Qy 1033 QAI QKSDEGH P FRAYLES EVAI S EELVQKYSNSALGHVNST I KELRRL FLVDDLVDS LKF 1092 

I I M I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I 
Db 69 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 128 

Qy 1093 AVLMWVFTYVGALFNGLTLLI LALI SLFSI PVI YERHQVQI DHYLGLANKSVKDAMAKIQ 1152 

I M I I M I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 129 AVLMWVFTWGALFNGLTLLILALISLFSIPVIYERHQAQIDHYLGL 188 

Qy 1153 AKI PGLKRKAD 1163 

II I I I I I I I I : 

Db 189 AKI PGLKRKAE 199 



RESULT 4 
RTN1 RAT 



ID RTN1_RAT STANDARD; PRT; 777 AA. 

AG Q64548; Q64547; 

. DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) (S-rex) . 

GN RTN1 OR NSP. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTNl-B AND RTNl-S) . 

RC STRAIN=Wistar; TISSUE=Brain cortex; 

RX MEDLINE=96386034; PubMed=87938 64 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G.P., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmentalization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane (By 

CC similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; f 

CC Name=RTNl-B; Synonyms^S-RexB; 

CC IsoId=Q64548-l; Sequence^Displayed; 

CC Name— RTNl-S ; Synonyms=S-RexS ; 

CC IsoId=Q64548-2; Sequence=VSP_005647, VSP_005648; 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN CENTRAL AND 

CC PERIPHERAL NERVOUS SYSTEM OF NEWBORN AND ADULT RATS. LOW LEVELS 

CC HAVE BEEN ALSO DETECTED IN HEART, ADRENAL GLAND AND SPLEEN. 

CC EXPRESSION OF ISOFORM RTNl-B IS RESTRICTED TO PARTICULAR NEURONAL 

CC TYPES. 

CC -!- DEVELOPMENTAL STAGE: DETECTED ON EMBRYONIC DAY ElO IN THE 

CC HINDBRAIN AND IN Ell IN THE FOREBRAIN. DURING THE NEXT 3 EMBRYONIC 

CC DAYS THE LEVELS OF S-REXS INCREASES AND REMAINS STABLE AT E13 IN 

CC THE HINDBRAIN AND AT E14 IN THE FOREBRAIN. THE LEVELS OF S-REXB 

CC DOES NOT CHANGE AS SIGNIFICANTLY DURING DEVELOPMENT OF THE 

CC HINDBRAIN. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics .and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb~sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC : 

DR EMBL; U17604; AAC53046.1; -. 

DR EMBL; U17603; AAC53045.1; -. 



DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 604 62 4 POTENTIAL. 

FT TRANSMEM 727 747 POTENTIAL. 

FT DOMAIN 590 777 RETICULON. 

FT DOMAIN 610 613 POLY-LEU. 

FT VARSPLIC 1 569 Missing (in isoform RTN1-S) . 

FT /FTId=VSP_005647. 

FT VARSPLIC 570 589 IPGPLGSDLVPPLPFFNKQK -> MQATADSTKMDCVWSNW 

FT KSQ (in isoform RTN1-S) . 

FT /FTId=VSP_00564 8. 

SQ SEQUENCE 777 AA; 83001 MW; AF7479C50F28D0AC CRC64; 

Query Match 13.7%; Score 801; DB 1; Length 777; 

Best Local Similarity 32.4%; Pred. No. 2e-23; 

Matches 258; Conservative 109; Mismatches 264; Indels 166; Gaps 32; 

Qy 485 EKT S P KT SN P FLVAVQDS E ADYVTTDTL- SKVTEAAVSNMPEGL — TPD 530 

I : : I I : I I I I I I I : : I : : I : I : II 

Db 29 EEATPKGARP AQQ D GE P AWG S GAGAGWS S RG LC S G PARS P P VAMETAS T GVAAVP D 8 5 

Qy 531 LVQEACESELNEATG T KI AYET KVDLVQT S EAI Q- E S L Y PTAQLC P S FEEAEA 582 

: : I : I : | | : | : : I I : : I - I I 
Db 86 ALDHSSSPTLKDGEGACYTSLISDICYPPREDSAYFTGILQKENGHITTSESP EELG 142 

Qy 583 TPSPVLPDIVMEAPLNSLLPSAGASWQPS VSPLEAPPPVSYDSIKLE 630 

Mill:: 1:111 : I : I : : I I I : I I 

Db 143 TPGPSLPEVPGTEP-HGLLSSDSGIEMTPAESTEVNKILADP L DQMKAEAC K Y I D I T 198 

Qy 631 PENPPPYEEAMNVALK ALGTK-EGIKEPE SFNAAVQETE 668 

I | | : : | : I I I I : I I : I 

Db 199 RPQEAKGQEEQSPGLEDKDLDFKDKDSEVSTKPEGVHAPNQPSPVEGKLIKDNLFEESTF 258 

Qy 669 APYISIACDLIKETKLSTEPSPDFSNYSEI AKFEKSVPEHAELVEDSSPESEPVD 723 

I I I I I : : : : I : I : I I : : : I I : I I I : : I 

Db 259 APYIDELSD — EQHRMSLVTAPVKITLTEIGPPVMTATHETIPEKQDLCLKPSPDTVPTV 316 

Qy 724 LFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK EERLSASPQELGKPYLESF 778 

I : I : : I I I I : I I : I : : I : I I .: 

Db 317 TVSE PEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIKEAKGLSYETTE 370 

Qy 779 QPN LHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETF 833 

I I I : : | | : I i I I I II 

Db 371 SPRPVGQAADRPKVKARSGLPTIPS SLDQEASSAESGDSEIELV 414 

Qy 834 SDSSPIEIIDEFPT FVSAKDDSPKLAK-EYTDLEVSDKSEIANIQSGADSLPCL 886 

I : I : II: II II ■ : I : I : : I : II : 

Db 415 SE- DPMAS EDALP S GYVS FGHVS GP P P S PAS PS I Q YS I LREEREAEL DSELII 466 

Qy 887 ELPCDL-SFKNIYPKDEVH VSDEFSENRSSVSKASISPSNVSALEPQTEM 935 

I I I I III II III: : I I : : I I : 

Db 467 E-SCDASSASEESPKREQDSPPMKPGVLDAI REETSSRATEERAPSHQGPVEPDPILSFT 525 

Qy 936 GSIV KSKSLTKEAE KKLPSDTEKEDRSLSAVLSAEL-- 971 

I : I II: II : I : I I : I : : I 



Db 



52 6 PVTLQSRPEPSSGDGAPVPEPPKSQQQKPEEEAVSSSQSPAATE IPGPLGSDLVP 58 0 



Qy 972 S KT S WDLLYWRDI KKTGWFGAS LFLLLS LTVFS I VSVTAYI ALALLS VT I S FR 102 6 

: I : I I I I I I I I I : I I : I I I : I II III I I : I M I I : I I I II I I I I I 
Db 581 PLPFFNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSVVSVVAYLALAALSATISFR 640 

Qy 1027 IYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDL 1086 

III I : I I : I I : I I I I I I : I II I I : : I : I : I I I :: : I I I I : I I I I I I I I I II 

Db 641 IYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDL 7 00 

Qy 1087 VDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKD 1146 

I I I II I I I I I I : I I M I I I I II I I I : : I : : I : I : : I I : I : I I I : I I I I I : 
Db 701 VDSLKFAVLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQVDQYLGLVRTHINT 760 



Qy 1147 AMAKIQAKI PGLKRKAD 1163 

= II I I I I I I I Ml: 
Db 761 WAK I QAK I P GAKRHAE 777 



RESULT 5 
RTN1_HUMAN 

ID RTN INHUMAN STANDARD; PRT; 776 AA. 

AC Q16799; Q16800; Q16801; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) . 

GN RTN1 OR NSP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTNl-A; RTN1-B AND RTN1-C) . 

RC TISSUE^Lung carcinoma; 

RX MEDLINE=93293865; PubMed=7685762 ; 

RA Roebroek A.J.M., Van de Velde H.J.K., Van Bokhoven A., Broers J.L.V., 

RA Ramaekers F.C.S., Van de Ven W.J.M.; 

RT "Cloning and expression of alternative transcripts of a novel 

RT neuroendocrine-specif ic gene and identification of its 135-kDa 

RT translational product."; 

RL J. Biol. Chem. 268:13439-13447(1993). 

RN [2] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE=96429995; PubMed=8 8 33145; 

RA Roebroek A.J.M., Ayoubi T.A.Y., Van de Velde H.J.K., 

RA Schoenmakers E.F.P.M., Pauli I.G.L., Van de Ven W.J.M.; 

RT "Genomic organization of the human NSP gene, prototype of a novel gene 

RT family encoding reticulons . " ; 

RL Genomics 32:191-199(1996). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX MEDLINE=98228245; PubMed=95604 66; 

RA Hens J., Nuydens R. , Geerts H. , Senden N.H., Van de Ven W.J.M., 

RA Roebroek A.J., van de Velde H.J.K., Ramaekers F.C., Broers J.L.; 

RT "Neuronal differentiation is accompanied by NSP-C expression."; 

RL Cell Tissue Res. 292:229-237(1998). 



cc 
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cc 
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cc 
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cc 
cc 
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cc 
cc 
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!- FUNCTION: May be involved in neuroendocrine secretion or in 

membrane trafficking in neuroendocrine cells. 
!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane. 
!- ALTERNATIVE PRODUCTS: 

Event=Alternative splicing; Named isoforms-3; 

Name=RTNl-A; Synonyms =NS P-A; 

IsoId=Q16799-l ; Sequence=Di splayed; 

Name=RTNl-B; Synonyms=NSP-B; 

IsoId=Q16799-2; Sequence=VSP_005644 ; 

Name=RTNl-C; Synonyms=NSP-C; 

IsoId=Q16799-3; Sequence=VSP_005645, VSP_005646; 
!- TISSUE SPECIFICITY: EXPRESSED IN NEURAL AND NEUROENDOCRINE TISSUES 

AND CELL CULTURES DERIVED THEREFROM. EXPRESSION OF ISOFORM RTNl-C 

IS STRONGLY CORRELATED WITH NEURONAL DIFFERENTIATION. 
!- PTM: Isoforms RTNl-A and RTN1-B are phosphorylated. 
!- SIMILARITY: Contains 1 reticulon domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; L10333; AAA59950.1 
EMBL; L10334; AAA59951.1 
EMBL; L10335; AAA59952.1 
PIR; A46583; A46583. 
PIR; 160904; 160904. 
Genew; HGNC: 10467; RTN1 . 
MIM; 600865; 
GO; GO: 0030176; 
GO; GO: 0004871; 
GO; GO: 0030182; 
GO; GO: 0007165; 



C: integral to endoplasmic reticulum membrane; 
F: signal transducer activity; NAS . 
P: neuron differentiation; TAS . 
P: signal transduction; NAS. 



TAS. 



InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 



DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane; 


KW 


Phosphorylation . 






FT 


TRANSMEM 


603 


623 


POTENTIAL. 


FT 


TRANSMEM 


726 


746 


POTENTIAL. 


FT 


DOMAIN 


589 


776 


RETICULON. 


FT 


DOMAIN 


609 


612 


POLY-LEU. 


FT 


VARSPLIC 


1 


420 


Missing (in isoform RTN1-B) . 


FT 








/FTId=VSP_005644 . 


FT 


VARSPLIC 


1 


568 


Missing (in isoform RTNl-C) . 


FT 








/FTId=VSP_005645. 


FT 


VARSPLIC 


569 


588 


GPGPLGPGAPPPLLFLNKQK -> MQATADSTKMDCVWSNW 


FT 








KSQ (in isoform RTNl-C). 


FT 








/FTId=VSP_005646. 


SQ 


SEQUENCE 


77 6 AA; 


83617 MW; 


CA5B6232353096FE CRC64; 



Query Match 13.5%; Score 789.5; DB 1; 

Best Local Similarity 31.2%; Pred. No. 5.3e-23; 
Matches 243; Conservative 115; Mismatches 252; 



Length 77 6; 
Indels 169; Gaps 



28; 



Qy 4 87 TSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGT 54 6 

: I : I I I : : : : I : I II : I : I : 
Db 65 SGPARQSP — VAMETASTGVAGVS S AMDHT FSTTSKDGEG SCYTSLI S 110 

Qy 547 KIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI — VMEAPLNS 599 

11:1 : : I I : : I I I MINI:: II 
Db 111 DICYPPQEDSTYFTGILQKENGHVTISESP EELGTPGPSLPDVPGIESRGLFSSDSG 167 

Qy 600 — LLPSAGASWQPSVSPLEAPPPVSY DSIKLEPENPPPYEEA M 641 

: I : I : I I : : I : : I : : : I I : 

Db 168 IEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEEVKHQEQHHPELEDKDLDFKNKDT 227 

Qy 642 NVALKALGTKEGI KEPE SFNAAVQETEAPYI SIACDLIKETKLSTE-PSP 690 

: : : : I I I : : I I : : : I I I I I I I : I : : : : I 

Db ^ 228 DISIK PEGVREPDKPAPVEGKI I KDHLLEESTFAPYI D DLSEEQRRAPQITTP 280 

Qy 691 DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSIPEV P QT Q E EAVMLMK 745 

: I I I :: : | | : : | | | :: | | | : : 

Db 281 VKITLTEIE PSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPEDDSPGSITPP 334 

Qy 746 ESLTEVSETVAQHK J — EERLSAS PQELGKP YLES FQP NLHSTKDAASND 792 

I II I : I I : | : : | : | | : | | : : 

Db 335 SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRPEVKARSGPPT 394 

Qy 793 IPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKI 827 

II: II:: : | | : : | | : 

Db 395 IPSPLDHEASSAESGDSEIELVSEDPM — AAEDALPSGYVS FGHVGGPPPSPASPSIQYS 452 

Qy 828 KESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSG 87 9 

: I I II I : I I I I : : I I I : : : : I I : 

Db 453 ILREEREAELDS ELIIESCDASSASEESPKREQDSPPMKPSALDAI REETGVRAEER 509 

Qy 880 ADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIV 939 

II II II : II I I I Nihil 
Db 510 APSRRGLAEPG — SFLD-YPSTEPQPGPEL PPGDGALEPETPM 54 9 

Qy 940 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRD 984 

II I : : I : : : I : I : I I I I I I I 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRD 597 

Qy 985 I KKTGVVFGAS LFLLLSLTVFS IVS VTAYI ALALLSVT I S FRI YKGVI QAI QKS DEGHP F 1044 

11:11:111: I II III I I : I I I I I : I I I II I I I I I I I I I : I I : I I : I I I I II 
Db 598 I KQTGIVFGS FLLLLFSLTQFSWSVVAYLALAALSATI S FRI YKSVLQAVQKTDEGHPF 657 

Qy 1045 RAY LE S EVAI S EE LVQ K Y SN S AL GHVN S T I KEL RRL FLVD D LVD S L K FAVLMWVFT YVGA 1104 

: I I I I I : : I : I : I I I : : : I I I I : I I I I I I I I I I I M I I I I I I I I I : I I I I I 

Db 658 KAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGA 717 

Qy 1105 LFNGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I M : : I : : I : I : : I I : I : II I I I II I I : : I I I I I I I I I I I I : 

Db 718 L FN GLT L L LMAWSM FT L P WYVKHQAQ I DQ YL GLVRT H I MAWAK I QAK I P GAKRHAE 776 

RESULT 6 
RTN3_HUMAN 

ID RTN3 HUMAN STANDARD; PRT; 236 AA. 



AC 095197; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 3 (Neuroendocrine-specif ic protein-like 2) " (NSP-like 

DE protein II) (NSPLII). 

GN RTN3 OR NSPL2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=Retina; 

RX MEDLINE=99265974; PubMed=10331947 ; 

RA Moreira E.F., Jaworski C.J., Rodriguez I.R.; 

RT "Cloning of a novel member of the reticulon gene family (RTN3) : gene 

RT structure and chromosomal localization to llql3."; 

RL Genomics 58:73-81(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Du G. , Yuan J., Qiang B . ; 

RT "Cloning and expression analysis of a cDNA encoding a novel 

RT neuroendocrine-specif ic protein-like protein 1: NSPL1."; 

RL Submitted (JAN-1999) to the EMBL/ GenBank/ DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain, Eye, and Lymph; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K., Farmer A. A. , Rubin G.M. , Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M. J. , Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S. r Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E. , Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- TISSUE SPECIFICITY: WIDELY EXPRESSED WITH HIGHEST EXPRESSION IN 
CC BRAIN. THREE TIMES MORE ABUNDANT IN MACULA THAN IN PERIPHERAL 

CC RETINA. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AF059524; AAC99319.1; -. 

DR EMBL; AF059529; AAD20951.1; -. 

DR EMBL; AF059525; AAD20951.1; JOINED. . 

DR EMBL; AF059526; AAD20951.1; JOINED. 

DR EMBL; AF059527; AAD2 0951.1; JOINED. 

DR EMBL; AF059528; AAD20951.1; JOINED. 

DR EMBL; AF119297; AAD26810.1; 

DR EMBL; BC000634; AAH00634.1; 

DR EMBL; BC010556; AAH10556.1; 

DR EMBL; BC011394; AAH11394.1; 

DR EMBL; BC022993; AAH22993.1; 

DR Genew; HGNC: 10469; RTN3 . 

DR MIM; 604249; -. 

DR GO; GO: 0005615; C : extracellular space; TAS . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 68 88 POTENTIAL. 

FT TRANSMEM 177 197 POTENTIAL. 

FT DOMAIN 4 8 236 RETICULON. 

SQ SEQUENCE 236 AA; 25609 MW; DDC6A4544ABCD.FB7 CRC64 ; 



Query Match 10.7%; Score 625.5; DB 1; 

Best Local Similarity 59.1%; Pred. No. 1.7e-17; 
Matches 114; Conservative 41; Mismatches 37; 



Length 236; 
Indels 1; 



Gaps 



1; 



Qy 



Db 



972 SKTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI S FRI YKGV 1031 
. I :l ll::IM:|lll III :| :llll :|: I I I I I I I I I I I I I I I 

44 SSCAVHDLIFWRDVKKTGFVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRIYKSV 103 



Qy 

Db 

Qy 

Db 

Qy 

Db 



1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 1091 
M I : I I I : I I I I I : I I I : : : : I I I I : I : I : I : I : I I I I I : I I I I I II 

104 IQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVHINRALKLIIRLFLVEDLVDSLK 163 

1092 FAVLMWFTWG7VLFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKI 1151 
M I I : I I I I I : I II : I I I II I : : I I : I : : I I : : : I I I II : I : I I : I I 

164 LAVFMWLMT YVGAVFNGITLLI LAELLI FSVPI VYEKYKTQI DHYVGI ARDQTKS I VEKI 223 

1152 QAKIPGL-KRKAD 1163 
I I I : I I : |:||: 
224 QAKLPGIAKKKAE 236 



RESULT 7 
RTN3_MOUSE 

ID RTN3_MOUSE STANDARD; PRT; 237 AA. 

AC Q9ES97; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 



DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 3. 

GN RTN3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus.' 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Qiang H., Yuan J., Qiang B.; 

RT "Cloning and expression profile of a novel mouse cDNA encoding a human 

RT RTN3 homolog. "; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Eye; 

RX MEDLINE=22388257; PubMed-12 477 932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L . , Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. r Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G . J. , Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N-, Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF195940; AAG31360.1; -. 

DR EMBL; BC014697; AAH14697.1; 

DR MGD; MGI: 1339970; Rtn3. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 69 8 9 POTENTIAL . 



FT TRANSMEM 
FT DOMAIN 
SQ SEQUENCE 



167 187 POTENTIAL . 

4 9 237 RETICULON. 
237 AA; 25428 MW; EB 6 0 A0A7 AC 4 5 FO DE CRC64; 



Query Match 10.7%; Score 625.5; DB 1; 

Best Local Similarity 59.1%; Pred. No. 1.7e-17; 
Matches 114; Conservative 41; Mismatches 37; 



Length 237; 
Indels 1; 



Gaps 



1; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



972 S KTS WDLLYWRDI KKTGWFGAS LFLLL S LTVFS I VS VTAYI ALALLS VT I S FRI YKGV 1031 
I : I I I :: I I I : I I I I III : I : I I I I I I : : I I : | : I I I I I I I I I I I : I I I 
45 S S CAVHDLI FWRDVKKTGFVFGTTLIMLLS LAAFSVT S WS YLI LALLSVTI S FRVYKSV 104 

1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 1091 
I I I : I I I : I I I I I : I I I : : : : I I I I : I : I I I : I : I I I I I : I I I I I I I 

105 IQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVHVNKALKLIIRLFLVEDLVT)SLK 164 

1092 FAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKI 1151 
I I I I : I I I I I : I I I : I I I II I : : I I : I : : I I : : : II I I I : I : I I : I I 

165 LAVFMWLMTYVGAVFNGITLLILAELLVFSVPIVYEKYKTQIDHYVGIARDQTKSIVEKI 224 

1152 QAKIPGL-KRKAD 1163 
llhll: I : I I : 
225 QAKLPGI AKKKAE 237 



RESULT 8 
RTN2__MOUSE 

ID RTN2_MOUSE STANDARD; , PRT; 471 AA. 

AC 070622; 070620; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), AND TISSUE SPECIFICITY. 

RC STRAIN=FVB/N, and 129/Sv; TISSUE=Cerebellum, and Skeletal muscle; 

RX MEDLINE-98191726; PubMed-9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W. , Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-2 82(1998). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TISSUE=Retina; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B . , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 



RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S . S . , Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X . , Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J. M. , Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Membrane-bound. Endoplasmic reticulum 

CC (Potential) . 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; Synonyms-Brain; 

CC Isold=070622-1; Sequence=Displayed; 

CC Name =2 ; Synonyms-Muscle; 

CC IsoId=O70622-2; Sequence=VSP_005650, VSP_005651; 

CC -!- TISSUE SPECIFICITY: Expressed predominantly in neural and muscular 
CC tissues. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF038537; AAC14906.1; -. 

DR EMBL; AF038537; AAC14907.1; -. 

DR EMBL; AF038538; AAC14908.1; -. 

DR EMBL; AF038539; AAC14909.1; -. 

DR ^ EMBL; AF093624; AAD13195.1; -. 

DR EMBL; BC031370; AAH31370.1; -. 

DR MGD; MGI: 107612; Rtn2 . 

DR InterPro; IPR003388; Reticulon.. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 2 95 315 POTENTIAL. 

FT DOMAIN 272 4 71 RETICULON. 

FT VARSPLIC 1 267 Missing (in isoform 2). 

FT /FTId=VSP_005650. 

FT VARSPLIC 268 271 PLLL -> MGSK (in isoform 2). 

FT /FTId=VSP_005651 . 

SQ SEQUENCE 471 AA; 51346 MW; 9BBD8F372CF63AD3 CRC64 ; 



Query Match 8.7%; Score 509; DB 1; Length 471; 

Best Local Similarity 28.6%; Pred. No. le-12; 

Matches 146; Conservative 84; Mismatches 175; Indels 106; Gaps 13; 



Qy 690 PDFSNYSEIAKFEKSVPEHAELVEDSSP — ESEPVDLFSDDSIPEVPQTQEEAVMLMKES 747 

I I : : I I I : I II I I I : I 
Db 6 PVFAHCKEAPSTASSTPDSTEGGNDDSDFRELHTAREFSED 46 

Qy 748 LTEVSETVAQH KEERL'SASPQELGKPYLESFQPNLHST 785 

I I I : I : : : I : I : I I I 

Db 47 — EEEETTSQDWGTPRELTFSYIAFDGWGSGGRRDSWRRPRPQGRSVSEPRDPPQQSG 104 

Qy 786 KDAASNDI PTLTKK EKISLQMEEFNTAIYSNDDLLSSKEDK 826 

: I I : I : : I : : I : : : : : I III 
Db 105 LGDSLESIPSLSQSPEPGRRGDPDPVPPAERPLEELRLRLDQLGWWRS AGSGED- 159 

Qy 827 IKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCL 886 

I I : I : I : I I : I : I I I : I I : : I I 
Db 160 SATSSSTPLE — NEEPDGLEASE AGEETNLEL RLAQ SL 195 

Qy 887 ELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTK 94 6 

I :: : I : I I I I : I : I I ::: : :| 

Db 196 HLQLEVLTPQLSPSSGTPQAHTPSPQRSQDSNS — GPDDEPLLNWEEHWRLLEQEPITA 253 

Qy 947 EAEKKLPSDTEKEDRSLSAVLSAELSKTSVVDLLYWRDIKKTGWFGASLFLLLSLTVFS 1006 

: I : : : I : I . I I I I I I : I : : I I I : III II 

Db 254 QC -LDSTDQSEFMLEPLL -LVADLLYWKDTRTSGAVFTGLMASLLCLLHFS 302 

Qy 1007 I VSVTAYI ALALLS VT I S FRI YKGVI QAI QKSDEGHP FRAYLES EVAI S EELVQKYSNS A 1066 

I I I I I I I I I I I I : I : I : II : : I : I I : I II : : : : : I : : I 
Db 303 IVSVAAHLALLGLCATISLRWRKVLQAVHRGDGTNPFQAYLDMDLTLTREQTERLSQQI 362 

Qy - 1067 LGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIY 1126 

II II : I I I I I : I I I I I I I I : I : : I : I I I : I I I I I I : I I : : : I I : : I : : I 

Db 363 AS HWS TAT Q L RH F FLVE D LVD S L KLAL L FY I LT FVGAI FN GLT LVI LGWAL FT VP L L Y 422 

Qy 1127 ERHQVQIDHYLGLANKSVKDAMAKIQAKIPG 1157 

: I I I I I I : I I : I I I : I I I I I 

Db 423 RQHQAQIDQYVGLVTNQLSHIKAKIRAKIPG 453 



RESULT 9 
RTN2 HUMAN 



ID RTN2_HUMAN STANDARD; PRT; 54 5 AA. 

AC 075298; 060509; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. , ALTERNATIVE SPLICING, AND TISSUE SPECIFICITY. 

RC TISSUE=Lung carcinoma; 

RX MEDLINE-98360096; PubMed=9 693037 ; 

RA Roebroek A.J.M., Contreras B., Pauli I.G.L., Van de Ven W.J.M.; 

RT "cDNA cloning, genomic organization, and expression of the human RTN2 



RT gene, a member of a gene family encoding reticulons . " ; 

RL Genomics 51:98-106(1998). 

RN [2] 

RP SEQUENCE OF 108-545 FROM N.A. (ISOFORM RTN2-B) . 

RC TISSUE=Brain; 

RX MEDLINE=98191726;^ PubMed=9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W. , Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 
CC reticulum (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms-2; 

CC Name=RTN2 -A; 

CC Isold=0752 98-1; Sequence=Di splayed; 

CC Note=Isoform RTN2-C is produced by alternative initiation at 

CC Met-341 of isoform RTN2-A; 

CC Name=RTN2-B; 

CC IsoId=075298-2; Sequence=VSP_00564 9 ; 

CC Event=Alternative initiation; 

CC Comment=2 isoforms, RTN2-A (shown here) and RTN2-C, are produced 

CC by alternative initiation at Met~l and Met-341; 

CC -!- TISSUE SPECIFICITY: ISOFORM RTN2-C IS HIGHLY EXPRESSED IN SKELETAL 
CC MUSCLE. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC ' 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. . Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF004222; AAC32542.1; -. 

DR EMBL; AF004223; AAC32543.1; -. 

DR EMBL; AF004224; AAC32544.1; -. 

DR EMBL; AF038540; AAC14910.1; -. 

DR Genew; HGNC: 10468; RTN2 . 

DR MIM; 603183; 

DR GO; GO:0030176; Crintegral to endoplasmic reticulum membrane; NAS . 

DR GO; GO: 0004871; F: signal transducer activity; NAS. 

DR GO; GO: 0007165; P: signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane; 

KW Alternative initiation. 

FT CHAIN 1 545 RETICULON PROTEIN 2, ISOFORM RTN2-A. 

FT CHAIN 341 545 RETICULON PROTEIN 2, ISOFORM RTN2-C. 

FT INIT_MET 341 341 FOR ISOFORM RTN2-C. 

FT TRANSMEM 368 388 POTENTIAL. 

FT TRANSMEM 4 63 483 POTENTIAL. 

FT DOMAIN 345 545 RETICULON. 

FT VARSPLIC 272 344 Missing (in isoform RTN2-B) . 

FT /FTId=VSP 005649. 



SQ SEQUENCE 545 AA; 



59263 MW; 971FD2F909E1E9E6 CRC64; 



Query Match 8.4%; Score 490; DB 1; Length 545; 

Best Local Similarity 28.3%; Pred. No. 6.3e-12; 

Matches 160; Conservative 76; Mismatches 202; Indels 128; Gaps 17; 

Qy 668 EAPYISIACDLIKETKLSTEPSPDFSNYSEI AKFEKSVPEHAELVEDSSPESEPVDL 724 

I I- 1 I I I I I I : : I : : I : I : : I 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEEDEEETTSQDWGTPRELTFSY '66 

Qy 725 FSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL-ESFQPNLH 783 

: I : : : : III I I : I I : : I I : I : I 

Db 67 IAFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPSLGDSLESI — PSLSQSPEPGRR 124 

Qy 784 STKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIID 843 

II I : i : I : : : III : I hhl 
Db 125 GDPDTAP PSERPLEDLRLRLDHLGWVARGT GSGED SSTSSSTPLE 169 

Qy 844 EFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEV 903 

I : I : I I I : I II II : I 

Db 170 DEEP QEPNRLETGEAGE EL — DLRLRLAQPSSPE 201 

Qy 904 HVSDEFSENRSSVSKASISP SNVSALEPQTE 934 

: : : I : : I I II III 

Db 202 VLTPQLSPGSGTPQAGTPSPSRSRDSNSGPEEPLLEEEEKQWGPLEREPVRGQCLDSTDQ 261 

Qy 935 MGSIVK— SKSLTKEAEKKLP S 954 

: I : : : II I : I 

Db 262 LEFTVEPRLLGTAMEWLKT SLLLAWKTVP I LELS P PLWTAI GWVQRGPT P PT PVLRVLL 321 

Qy 955 DTEKEDRSLSAVLSAELSK TS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVT 1011 

I I I I I I I : I I I I I I : I : : I I I I : III I I I I I I 

Db 322 KWAKSPRS-SGVPSLSLGADMGSKVADLLYWKDTRTSGWFTGLMVSLLCLLHFSIVSVA 380 

Qy 1012 AYI ALALLS VT I S FRI YKGVI QAIQKS DEGHP FRAYLES EVAI S EELVQKYSNS ALGHVN 1071 

I::|| II I I I I : I : I : I I : : I :||:|||: :: :: I :: |: I 
Db 381 AH LAL L L L C GT I S LRVYRKVLQ AVH RG DGAN P FQAYL D VD LT LT REQT E RL S HQ ITS RW 440 

Qy 1072 STIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQV 1131 

I : I I llhlllllll I : I : : I : I I I : I I I I I I I I I : I II : I I : : I : I I 
Db 441 SAATQLRHFFLVEDLVDSLKLALLFYILTFVGAIFNGLTLLILGVIGLFTIPLLYRQHQA 500 

Qy 1132 QIDHYLGLANKSVKDAMAKIQAKIPG 1157 

1111:11 : I I I : I I I I I 

Db 501 QIDQYVGLVTNQLSHI KAKI RAKI PG 526 



RESULT 10 
PCLO_HUMAN 

ID PCLO_HUMAN STANDARD; PRT; 5147 AA. 

AC Q9Y6V0; 043373; 060305; Q9BVC8; Q9UIV2; Q9Y6U9; 
DT 28-FEB-2003 (Rel. 41, Created) 
DT 28-FEB-2003 (Rel. 41, Last sequence update) 
DT 10-OCT-2003 (Rel. 42, Last annotation update) 
DE Piccolo protein (Aczonin) (Fragments) . 
GN PCLO OR ACZ OR KIAA0559. 
OS Homo sapiens (Human) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE OF 1-759 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=99439764; PubMed-105088 62 ; 

RA Wang X., Kibschull M. , Laue M.M. , Lichte B., Petrasch-Parwez E., 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin."; 

RL J. Cell Biol. 147:151-162(1999). 

RN [2] 

RP SEQUENCE OF 552-4404 FROM N.A. 

RA Kraemer J., Wollam C, Wohldmann P., McGrane B. ; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 3619-5147 FROM N.A. (ISOFORM 2). 

RC TISSUE=Brain; 

RX MEDLINE-98290545; PubMed=962 858 1 ; 

RA Nagase T., Ishikawa K.-I., Miyajima N., Tanaka A., Kotani H., 

RA Nomura N . , Ohara 0.; 

RT "Prediction of the coding sequences of unidentified human genes. IX. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res. 5:31-39(1998). 

RN [4] 

RP SEQUENCE OF 4405-4439 FROM N.A. 

RC TISSUE-Placenta; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F. f Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. f Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [5] 

RP SEQUENCE OF 4405-5147 FROM N.A. 

RA Kalicki J., Elliott G. ; 

RL Submitted (FEB-1998) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 

CC organization of synaptic active zones and in synaptic vesicle 



CC trafficking (By similarity) . 

CC -!- SUBUNIT: Interacts with Rabacl/Pral, RIMS2 and profilin (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 
CC synaptic junctions (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms-2; 

CC Comment^Additional isoforms seem to exist; 

CC Name=l; 

CC IsoId=Q9Y6V0-l ; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q9Y6V0-2; Sequence=VSP_003923, VSP_003924, VSP_003925, 

CC VSP_003926, VSP_003927; 

CC Note=No experimental confirmation available; 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y19188; CAB60727.1; -. 

DR EMBL; AC004903; AAD20936.1; 

DR EMBL; AC004886; AAD21789.1; 

DR EMBL; AB011131; BAA25485.1; -. 

DR EMBL; BC001304; AAH01304.1; -. 

DR EMBL; AC004082; AAB97937.1; 

DR PIR; T00634; T00634. 

DR HSSP; P04410; 1A25. 

DR Genew; HGNC: 13406; PCLO. 

DR MIM; 604918; -. 

DR GO; GO:0005856; C : cytoskeleton; NAS . 

DR GO; GO:0045202; C:synaptic junction; ISS. 

DR GO; GO: 0005509; F: calcium ion binding; ISS. 

DR GO; GO:0005544; F : calcium-dependent phospholipid binding; ISS. 

DR GO; GO: 0005522; F:profilin binding; ISS. 

DR GO; GO:0007010; P : cytoskeleton organization and biogenesis; ISS. 

DR. GO; GO:0016080; P:synaptic vesicle targeting; ISS. 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001565; Synaptotagmin . 

DR PRINTS; PRO 03 60; C2 DOMAIN. 

DR PRINTS; PR00399; SYNAPTOTAGMN . 

DR SMART; SM00239; C2; 2. 

DR PROSITE; PS00499; C2_DOMAIN_l; 1. 

DR PROSITE; PS 50 004; C2_DOMAIN_2; 2. 

KW Calcium/phospholipid-binding; Zinc; Metal-binding; Zinc-finger; 

KW Repeat; Alternative splicing. 

FT NON_TER 11 

FT DOMAIN 400 465 10 X 10 AA TANDEM APPROXIMATE REPEATS OF 

FT P-A-K-P-Q-P-Q-Q-P-X. 



FT 


ZN_FING 


499 


523 


C4-TYPE (POTENTIAL) . 


FT 


ZN_FING 


969 


992 


C4-TYPE (POTENTIAL) . 


FT 


N0N_CONS 


1010 


1011 




FT 


DOMAIN 


2300 


2325 


POLY- PRO. 


FT 


DOMAIN 


4391 


4442 


PDZ . 


FT 


DOMAIN 


4544 


4633 


C2 DOMAIN 1. 


FT 


DOMAIN 


5031 


5121 


C2 DOMAIN 2. 


FT 


VARSPLIC 


4404 


4404 


S -> SGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAE 


FT 








QTGKLMEG (in isoform 2). 


FT 








/FTId=VSP_00392 3. 


FT 


VARSPLIC 


4534 


4534 


K -> KPTDGTKWSHPITGEIQ (in isoform 2). 


FT 








/FTId=VSP_003924. 


FT 


VARSPLIC 


4576 


4576 


G -> GQVMWQNAS (in isoform 2). 


FT 








/FTId=VSP_003925. 


FT 


VARSPLIC 


4757 


4761 


TAHKS -> SKRRK (in isoform 2) . 


FT 








/FTId=VSP_003926. 


FT 


VARSPLIC 


4762 


5147 


Missing (in isoform 2) . 


FT 








/FTId=VSP_003927. 


SQ 


SEQUENCE 


5147 


AA; 563537 


MW; CD5D84990498CD3C CRC64 ; 



Query Match 5.3%; Score 308.5; DB 1; Length 5147; 

Best Local Similarity 21.4%; Pred. No. 0.00076; 

Matches 264; Conservative 150; Mismatches 438; Indels 383; Gaps 63; 

Qy 3 DIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERKPA 62 

II: I I I I : I I : : I : : : I : I I 

Db 251 DIVRGESVKPSLPSPSKPP - IQQPTPGKPPAQQPGHEKSQPG PAKPP 296 

Qy 63 AGLSAAAVP PAAAAPLLDFSSDSVPPA-PRGPLPAAPPA A 101 

II i I I : : I I I I I I I I I I I 

Db 297 AQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLG— PAKPPAQQTGSEKPSSEQPGPKA 354 

Qy 102 PERQPSWERSPA-APAPSLP rPAAAVLPSKLPEDDEPPA 138 

: I : : I I I I : I Nihil : I I : 

Db 355 LAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPAQTKPPSQQP 414 

Qy 139 RP P P P P P AGAS P LAE PAAP P S T P AAP KRRG S G S VDET L FAL PAAS E P VI P S S AE KI M 195 

: I I I I I I I I I I :: I I I I : I : III: 
Db 415 GSTKPPPQQPGPAKP— SPQQPGSTKPPSQQPGS AKPSAQQ PSPAKPSA 4 61 

Qy 196 DLMEQPGNTVSSGQEDFPSVLLETAASLPS LSPL-STVSFKEH-GYLGNLSAVS 247 

: I : I : I : : I II : I I : I I I : : 

Db 4 62 QQFTKPVSQTGFGKPLQPPTVSPSAKQPPSQGLPKTICPLCNTTELLLHVPEKANFNTCT 521 

Qy 248 SSEGTI EETLNEASKEL . PERATNPFVNRDL 277 

: I : .111:1 I : I I 

Db 522 ECQTTVCSLCGFNPNPHLTEAKEWLCLNCQMKRALGGDLAPVPSSPQPKLKTAPVTTTSA 581 

Qy 278 AEFSELEYSEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKEDLVCSAALHSPQES PVGK 337 

I : : : I I : : II : I : : | | : : | 

Db 582 VSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPEADSLSK 641 

Qy 338 EDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVES 397 

•I I : :l III II I! :| III: I :: I : 

Db 642 P APPKEPSVPSEQDK APV ADDKP-KQPKMVKPTTD LVSSSSATTKP 68 6 

Qy 398 KVDRKCLEDSLEQKS LGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTT 454 



Db 687 DIPSSKVQSQAEEKTTPPLKTDSAKPSQ — SFPPTGEKV TPFDSKAIPRP 734 

Qy 455 ANTFPLLE — DHTSENKTD EKKIEERKAQIITEKTSPK 4 90 

I: : I I : I :|| i :||| I III 

Db 735 ASDSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQ TKMSPKPDAKPMPKGSPTPPGP 791 

Qy 491 — TSNPFLVAVQDS EADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEAT 544 

I : : I I I : | : | : | | | | | | 
Db 792 RPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQ PTTPQET VT 838 

Qy 545 GTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSA 604 

I : : I | | : | | : | : :|| | |:: 

Db 839 GKLFGFGASI-FSQASNLISTAGQPG PHSQSGPGAP -MKQAPAPSQPPTS 886 

Qy 605 GASWQPSVSPLEAPP PVSYD SIKLEP ENPPPYEE 63 9 

I I : I I I II: : I I I I : I I I :: 

Db 8 87 QG PPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPIKD 942 

Qy 640 AMNVALK ALGTK EG I KE P ES FNAAVQ E 666 

= : : : IN I I : | : M : 

Db 943 SKSLTAEPQKAVLPTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGF 1002 

Qy 667 TEAPYISIAC DLIK ETKLSTEPSPDFSNYSEIAKFEKSVPEHA 709 

I::: I hi I I I I :| : : :| I 

Db 1003 NPTPHLTENCQTQRAISGQLGDIRKMPPAPSGPKAS PMPVP TESSSQKTAVPPQV 1057 

Qy 710 ELV EDSSPESEPV DLFSDDSIPEV PQTQEEAVMLMKESLTEVSETVA 756 

s I I : : hi I : I : I I : I I I h I I : : : I 

Db 1058 KLVKKQEQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQEES-KLEKDKASALQEKKP 1116 

Qy 757 QHKEERLSASPQEL GKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTA 812 

: h : I : : : I I I I : I I I : I I I : 
Db -. 1117 LPEEKKLIPEEEKIRSEEKKPLLEEKKPTPEDKK LLPEAKTSAPEEQ 1163 

Qy 813 IYSNDDLLSSK EDKI KESETFSDSSPIEIIDEFPTFVSAKDDS- 855 

Mil: |:|: I : : I : I | : : | | 

Db 1164 KHDLLKSQVQIAEEKLEGRVAPKTVQEGKQPQTKMEGLP SGTPQSLPKEDDKT 1216 

Qy 856 PKLAKEYTDLEVSDKSEIANIQ SGADSLPCLELPCD 8 91 

III : I I I I : : : | | | | : | 

Db 1217 TKTIKEQPQPPCTAKPDQEKE-DDKSDTSSSQQPKSPQGLSDTGYSSDGISSSLGEIP — 1273 

Qy 892 LSFKNIYPKDEVHV SDEFSENRSSVSK ASISPSNVSALEPQTEMGSIVKSKS 943 

— III : 111:11 I : : : I I I I : I h 

Db 1274 SLIPTDEKDILKGLKKDSFSQESSPSSPSDLAKLESTVLSILEAQASTLADEKSEK 1329 

Qy 94 4 LTK — EAEKKLPSDTEKEDRSLSAVLSAELSKTSV 976 

I : I : I I II : I I I I : h : 
Db 1330 KTQPHEVSPEQPKDQEK-TQSLSETLEITISEEEI 1363 

RESULT 11 
PCLO_CHICK 

ID PCLO_CHICK STANDARD; PRT; 5120 AA. 

AC Q9PU36; 

DT 28-FEB-2003 (Rel. 41, Created) 



DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Piccolo protein (Aczonin) (Fragment) . 

GN PCLO OR ACZ. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=994397 64; PubMed-10508862 ; 

RA Wang X., Kibschull M. , Laue M.M., Lichte B . , Petrasch-Parwez E . , 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin."; 

RL J. Cell Biol. 147:151-162(1999). 

CC -!-. FUNCTION: May act as a scaffolding protein involved in the 
CC organization of synaptic active zones and in synaptic vesicle 

CC trafficking (By similarity) . 

CC -!- SUBUNIT: Interacts with Rabacl/Pral and profilin (By similarity). 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 
CC synaptic junctions (By similarity) . 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

cc specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y19187; CAB60725.1; -. 

DR HSSP; P04410; 1A25. 

DR GO; GO:0045202; C:synaptic junction; ISS. 

DR GO; GO: 0005509; F: calcium ion binding; ISS . 

DR GO; GO: 0005544; F: calcium-dependent phospholipid binding; ISS. 

DR GO; GO:0005522; Frprofilin binding; ISS. 

DR GO; GO:0007010; P : cytoskeleton organization and biogenesis; ISS. 

DR GO; GO: 0016080; P: synaptic vesicle targeting; ISS. 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001478; PDZ . 

DR InterPro; IPR001565; Synaptotagmin . 

DR InterPro; IPR008899; Znf_piccolo. 

DR Pfam; PF00168; C2; 2. 

DR Pfam; PF00595; PDZ; 1. 

DR Pfam; PF05715; Zf_piccolo; 2. 

DR PRINTS; PR00399; SYNAPTOTAGMN . 

DR SMART; SM00239; C2 ; 2. 

DR SMART; SM00228; PDZ; 1. 



DR 


PROSITE; 


PS00499; 


C2_DOMAIN_ 


1; 1. 


DR 


PROSITE; 


PS50004; 


C2_DOMAIN~ 


2; 2. 


DR 


PROSITE; 


PS50106; 


PDZ; 1. 




KW 


Calcium/phospholipid-binding; Zinc; Metal-binding; Zinc- finger; 


KW 


Repeat . 








FT 


MOM TTTD 


1 


1 




FT 


DOMAIN 


258 


357 


10 X 10 AA TANDEM APPROXIMATE REPEATS 


FT 








P-A-K-P-Q-P-Q-Q-P-X. 


FT 


ZN_FING 


368 


392 


C4-TYPE (POTENTIAL) . 


FT 

£ A. 


ZN_FING 


836 


859 


C4-TYPE (POTENTIAL) . 


FT 


DOMAIN 


2324 


2343 


POLY- PRO. 


FT 


DOMAIN 


4414 


4493 


PDZ . 


FT 


DOMAIN 


4627 


4726 


C2 DOMAIN 1. 


FT 


DOMAIN 


5003 


5094 


C2 DOMAIN 2. 


SQ 


SEQUENCE 


5120 AA; 560751 


MW; A658D9891B65B412 CRC64; 


Query Match 




5.1%; 


Score 300; DB 1; Length 5120; 



Best Local Similarity 20.4%; Pred. No. 0.0016; 

Matches 258; Conservative 152; Mismatches. 464; Indels 388; Gaps 56; 

Qy 14 TDSPPRPPPAFKYQFVTEPEDEEDEEE EEDEEEDDEDJLEELEVLERKPAAGLSAA 68 

: I I I : I I I : I : I I : ::::::: : : | | : 

Db 90 SDSDAAHEEAGRKQKVTQKEQGKPEEQRGLAKHPSQQQSPKLVQQQGPVKPTPQQTESSK 149 

Qy 69 AVP PAAAAPLLDFSSDSV — PPAPRGP LPAAPPAAPERQPSWE 109 

M I : I I :: I I I I I I I I : I I : I : 

Db 150 PVPQQQQPGEPKQGQKPGPSHP-GDSKAEQVKQPPQPRGPQKSQLQQSEPTKPGQQQTSA 2 08 

Qy 110 RSPAAPAPSLP — PAAAVLPSKLPEDDEP PAR P 14 0 

: : I I I I I : I hi : I III I 

Db 209 KTSAGPTKPLPQQPDSAKTSSQAPPPTKPSLQQSGSVKQPSQQPARQGGPVKPSAQQAGP 268 

Qy 141 P P P P P AGAS P LAE P AAP P S T PAAP KRRG S GS VDET — LFALPAASEPVIPSS — AEKIMD 196 

I I II: I II I : : I : : | : | | M I 

Db 269 PKQQPGSEKPTAQQTGPAKQPPQP-GPGKTPLQQTGPVKQVPPQAGPTKPSSQTAGAAKS 327 

Qy 197 LMEQPGNTVS SGQEDFPSVLLETAAS LPSLS PLSTVS FKEHGYLGNLSAVS S SEGTI EET 256 

I : I I I I I I : I | : : | I I : : I 

Db 32 8 LAQQPGLTKPPGQQPGPEKPLQQKQA STTQPVESTPKKT 3 66 

Qy 257 L --NEASKELPERAT NPFVNRDLAEFSE LEYS 286 

I I I : I I I I I I . : : : 

Db 367 FCPLCTTTELLLHTPEKANYNTCTQCHTWCSLCGFNP — NPHITEIKEWLCLNCQMQRA 424 

Qy 287 EMGSSFKGSPKGESAI LVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPE- 345 

I II : I I I I III I |:| I : 

Db 425 LGGDLASGHGPGPQ LPPPKQKTPTPASTAKPSPQLQPGQKKDASPKPDP 4 73 

Qy 346 KTMD I FNEMQMS WAP VRE E YAD FK P FEQAWE VKDT YEGS RDVLAARAN VE S K 398 

I : : I II:: : I : : : I : I I : | 
Db 474 SQQADSKKPVPQKKQPSMPGSPPVKSKQTHAEPSDTGQQI-DSTPKSDQVKPTQA 527 

QY 399 VDRKCLEDSLEQKSLGKD SEGRNEDASFPSTPEPVKDSSRAYITCAS FTSA 449 

I : I I : I : I : I : I : I I : : : I : 

Db 528 EEKQNQP S I QKPTMDTVPT SAAPGVKQDLADPQS P STQQKVTDSPMPET 576 



Qy 



450 TESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSP 



KTSNPFLV 4 97 



I : | : | : : | | | : I : : | : | III 

Db 577 TK — PPADTHPAGDKPDSKPL PQVSRQKSDPKLASQSGAKSDAKTQKPSEP 625 

Qy 498 A-VQDSEADYVT TDTLSKVTEAAVSNMPE 525 

I 1:1 I I : I | : |: 

Db 62 6 APVKDDPKKLQTKPAPKPDTKPAPKGPQAGTGPRPTSAQPAPQPQQPQKTPEQSRRFSLN 685 

Qy 526 — GLT PDLVQEACESEL NEA TGTKIAYETKVDLVQTSEAIQ 564 

hi III :| ::| I : :| : : I 

Db 686 LGGITDAPKPQPTTPQETVTGKLFGFGASIFSQASSLISTAGQPGSQTSGPAPPATKQPQ 745 

Qy 565 ESLYPTAQLCPSFEEAEATPSP-VLPDIVMEAPLNS--LLPSAGASWQPSVSPLEAPPP 621 

II 111:11 I I I I I I I I : I I I I 

Db 74 6 PPSQPPASQAPPKEAAQAQPPPKAAPTKKETKPLASEKLGPMASDSTLTTKGSDLEKKPS 805 

Qy 622 VSYDSIKLEPENPPPYE EAMNVALKALGTKEGIKEPESFNAA 663 

:: I I I II :: I I I I : I : I I 

Db 806 LAKDSKHQTAEAKKPAELSEQEKASQPKVSCPLCKTGLNIGSKDPPNFNTCTECKKWCN 865 

Qy 664 VQETEAP YI S IAC DLIK 680 

III :: : I I: I 

Db 866 LCGFNPMPHIVEVQE WLCLNCQTQRAMSGQLGDMGKVPLPKLGPSQPVSKPPATPQ 921 

Qy 681 — ETKLSTEPSPDFSNYSEIAKFEKSVP — EHAEL VEDSS 716 

I I I I : I : I I I I : : I : : 

Db 922 KQPVPAVSHSPQKSSTPPTPAATKPKEEPSVPKEVPKLQQGKLEKTLSADKIQQGIQKED 981 

Qy 717 PESEPVDLF SDDSIPEVPQTQEEAVMLMKESLTEVSETVAQH-KEERLSASPQELGK 772 

: I : II III II:: : I : I I : : : : I : : | | 

Db 982 AKSKQGKLFKTPSADKIQRVSQKEDSRLQQTKLTKTPSSDKILHGVQKEDIKFQEAKLAK 1041 

Qy 773 PYLES FQPNLHSTKDA ASNDIPTLTKKEKISLQMEEFNTAIYSN 816 

I: M II ::: I :|| : II : |: :: 

Db 1042 IPSADKILHRLQKEDPKLQQMKM7UCALSADKIQPEAQKEDVQLQEVRLSKAVSADKIQHG 1101 

Qy 817 —DDL LSSKEDKI KESETFSDSSPIEII-DEFPTFVSA — 851 

II : I : I I Ml: I I : I : I I I : 

Db 1102 IQKDLNLQHVKI EKTS SVEKIQEAQKESKLQQDKLPKTLS EDKI PATVS SDHKKLLSKSE 1161 

Qy 852 KDDSPKLAKEYTDLEVSDKSEIANIQSG — ADSLPCLELPCDLSFKNIYPKDEVHVSDE- 908 

: I I : I : : I I : I : I : : I I I I : : I : | | 

Db 1162 EDKKPELLEKSTPHPKDKKEQITAETTGHITEQKVEVEAPCD KLHEKKQEDVKKED 1217 

Qy 909 FSENRSSVSKA— SISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRS 962 

: I II :| II I I::: |: | : || :| ||| : 
Db 1218 LTTGIPQMVSKPEKAEEEKTPVPVSRL-PRSDHVEAVREK-IEKEDDK SDTSSSQQQ 1272 

Qy 963 LS 964 

I 

Db 1273 KS 1274 



RESULT 12 
CPN_DROME 

ID CPN_DROME STANDARD; PRT; 865 AA. 

AC Q02910; 

DT 01-OCT-1993 (Rel. 27, Created) 



DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Calphotin. 

GN CPN OR CAP. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S; 

RX MEDLINE=93165729; PubMed=8 094 559 ; 

RA Martin J.H., Benzer S., Rudnicka M., Miller C.A. ; 

RT "Calphotin: a Drosophila photoreceptor cell calcium-binding protein."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1531-1535(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S; 

RX MEDLINE-93165730; PubMed=8434 015 ; 

RA Ballinger D.G., Xue N., Harshman K.D.; 

RT "A Drosophila photoreceptor cell-specific protein, calphotin, binds 

RT calcium and contains a leucine zipper."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1536-1540(1993). 

CC -!- FUNCTION: Might function as a calcium-sequestering "sponge" to 
CC regulate the amount of free cytoplasmic calcium. It binds 0.3 mole 

CC of Ca(2 + ) per mole of protein. 

CC -!- SUBUNIT: Homodimer (Probable). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic; hypodense compartment. 

CC -!- TISSUE SPECIFICITY: Soma and axons of photoreceptor cells of 
CC compound eyes and ocelli. 

CC -!- DEVELOPMENTAL STAGE: Expressed early in photoreceptor cell 
CC development. 

CC _ 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC _. 

DR EMBL; L02111; AAA28405.1; -. 

DR EMBL; L05080; AAA28420.1; -. 

DR PIR; A47282; A47282^ 

DR PIR; A47283; A47283. 

DR FlyBase; FBgn0010218; Cpn. 

DR GO; GO: 0005509; F: calcium ion binding; IDA. 

KW Calcium-binding. 



FT 


CONFLICT 


36 


36 


A - 


>'AVAPAWA (IN REF 


FT 


CONFLICT 


43 


43 


I - 


> T (IN REF. 2) . 


FT 


CONFLICT 


64 


64 


I - 


> V (IN REF. 2) . 


FT 


CONFLICT 


76 


76 


T - 


> A (IN REF. 2) . 


FT 


CONFLICT 


100 


100 


P - 


> PP (IN REF. 2) . 


FT 


CONFLICT 


126 


127 


VQ 


-> AP (IN REF. 2) . 


FT 


CONFLICT 


154 


154 


I - 


> V (IN REF. 2) . 


FT 


CONFLICT 


160 


160 


S - 


> T (IN REF. 2) . 



FT 


CONFLICT 


534 


534 


A 


-> E (IN REF. 2) . 


FT 


CONFLICT 


699 


699 


I 


-> T (IN REF. 2) . 


FT 


CONFLICT 


703 


703 


V 


-> L (IN REF. 2) . 


FT 


CONFLICT 


721 


721 


D 


-> E (IN REF. 2) . 


SQ 


SEQUENCE 


865 AA; 


847* 


31 MW; 


2110417E0B0E7CFE CRC64 ; 



Query Match 5.1%; Score 295.5; DB 1; Length 865; 

Best Local Similarity 21.3%; Pred. No. 0.00024; 

Matches 217; Conservative 127; Mismatches 379; Indels 295; Gaps 41; 

Qy 62 AAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPER QPSWERSPAAPAP 117 

: I : : I Mill: I Mill I : III I I : I I I II 

Db 11 S AP VAAP VT P S AVAAP VQ WS P AAVAP APAAP I AVT P VAP PPT LAS VQ PAT VT I P - APAP 6 9 

Qy 118 SLPPAAAVLPSKLPEDDEPPARPPPPPPAGA SPLAEPAAPPSTP 161 

I : I I : I I : III III : I : I I I II 

Db 70 IAAASVTPVASVAPPWAAPTPPAASPVSTPVAVAQIPVAVSAPVAPPVAATPTPWQIP 129 

Qy 162 -AAPKRRGSGSVDETLFALP — AASEP VI PS SAEKIMDLMEQPGNT VSSGQED 211 

III : I I I I I I I I : I III I : : 

Db 130 VAAP VIATPPVAASAPTPAAWPVISPVIASPPWPANTTVPVAAPVAA 178 

Qy 212 FPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATN 270 

I : : I I I ::: I I :: : : I I I : I : I I 

Db 179 VPAAVPWAPVLAPAVAPAVAPWAETPAPPPVAEIPVA — TIPECVAPLIPEVSWATK 236 

Qy 271 P FVNRDLAEFS ELE YS EMGS S FKGS PKGES AI LVENTKEEVI VRS KDKEDLVC SAALHS P 330 

I I I : I I I : I I I 

Db 237 PLA AAE PVWAP PAT ET P WAPAAAS P 263 

Qy 331 QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLA 390 

I I : I : : : | | | | I 

Db 2 64 HVS VAP AVETAWAPV 279 

Qy 391 ARANVESKVDRKCLEDSLEQKSL GKDSEGRNEDASFPSTPEPVKDSSRAYITCASF 446 

1:11 I : I : I : I : I I I I : : : : 

Db 280 - S AS T E P P VAAAT LT TAP ET P ALAP WAE S Q VAANT WAT PPT PAP E P ET I AP PWAET P 338 

Qy 447 TSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADY 506 

I : : II:: : I : I I I : I I I : 
Db 339 EVASVAVAETTPPWPPVAAES 1 PAP WAT T P VP AT LAVT D P D 381 

Qy 507 VTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES 566 

11:11:1:111 I I I I I 
Db 382 VT AS AVP EL P P VI AP S P VP S A VAETPVDLA PPV 414 

Qy 567 LYP-TAQLCPSF — EEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAP 619 

II I : I : II I I : I : : I I : : I : I I : I I I I 

Db 415 L P PVAAE PVPAWAEET P ET PAPAS APVT I - AALD — I PEVAPVI AAP S DAPAEAP SAAA 471 

Qy 620 PPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACD-- 677 

III II 1:11111::: 

Db 472 PIVS TPPT TASVPETTAPPAAVPTEPI 498 

Qy 678 LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE — LVEDSSPESEPVDLFSD-DSIP 731 

: : I : I : I : I : I : : I I I : : I I : II I : I 

Db 4 99 DVS VLS EAAI ET PVAP PVEVTTEVAVADVAP PEAAADL 1 1 E PVE P PAP I PDLL EQTT SVP 558 



Qy 732 EVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASN 7 91 

I : : : : I I : I I I :: : I : I I : I : I : : 

Db 559 AVEAAESTSSPIPETSLPPPNEAVA SPEVAVAPITAPEPI PEP-EPSLATPTEPI PV 614 



Qy 792 DIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSA 851 

: I : : I : : I : : I I I I I I : 

Db 615 EAPV VI Q EAVDAVEVP VT ET S T S I P ET T VE F P EAVAE 651 



Qy 852 KDDSPKLAKEYTDLEV-SDKSEIANIQSGAD SLPCLEL PCDLSFKNIYP- 8 99 

I I : | : | : : : : | | | | | : I : I : I : : II 

Db 652 KVLDPAI TEAPVTTQEPDVANINDGAPATEITTPAVEIVTAAAEVSDIAI PVIDPP 7 07 

Qy 900 -KDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMG — 936 

I : I : I I : : : : I : I I : 

Db 708 VPQE I AVA- E I P ET DTKPAEVI VEQ S T I P I EAPVP EVS K YAE P VI S EAP AAEVP I TAGDN 766 

Qy 937 SIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGV 990 

I : I : : III: I I : I : : I I *| I : : I I 

Db 767 PDNTSVGISEWPTIAEKPVEEVPTSEIPEQSSSPSDSVPVAKITPLL — RDLQTTDV 822 



RESULT 13 
MAPB_RAT 

ID MAPB_RAT STANDARD; PRT; 2459 AA. 

AC P15205; Q62958; Q9ER21; Q9QW92; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) (Neuraxin) [Contains: MAPI 

DE light chain LCI] . 

GN MAP1B. 

OS Rattus norvegicus (Rat) „ 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

. RN [1] 

RP SEQUENCE OF 1-142 FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Testis ; 

RX MEDLINE-96257242; PubMed=8 666295; 

RA Liu D., Fischer I.; 

RT "Isolation and sequencing of the 5' end of the rat microtubule- 

RT associated protein (MAP1B) -encoding cDNA."; 

RL Gene 172:307-308(1996). 

RN [2] 

RP SEQUENCE OF 96-2459 FROM N.A. f DOMAIN, AND INDUCTION. 

RC STRAIN=Sprague-Dawley; TISSUE=Brain, and Glial tumor; 

RX MEDLINE-92347374; PubMed-163 9092 ; 

RA Zauner W., Kratz J., Staunton J., Feick P., Wiche G.; 

RT "Identification of two distinct microtubule binding domains on 

RT recombinant rat MAP IB."; 

RL Eur. J. Cell Biol. 57:66-74(1992). 

RN [3] 

RP SEQUENCE OF 1541-2459 FROM N.A. , AND TISSUE SPECIFICITY. 

RC TISSUE-Spinal cord; 

RX MEDLINE=90059871; PubMed=2555150 ; 

RA Rienitz A., Grenningloh G., Hermans-Borgmeyer I., Kirsch J., 



RA Littauer U.Z., Prior P., Gundelf inger E.D., Schmitt B . , Betz H. ; 

RT "Neuraxin, a novel putative structural protein of the rat central 

RT nervous system that is immunologically related to microtubule- 

RT associated protein 5."; 

RL EMBO J. 8:2879-2888(1989). 
RN [4] 

RP DEVELOPMENTAL STAGE, AND PHOSPHORYLATION. 

RX MEDLINE=97405699; PubMed-9260743; 

RA Ma D., Nothias F. , Boyne L.J., Fischer I.; 

RT "Differential regulation of microtubule-associated protein IB (MAP1B) 

RT in rat CNS and PNS during development."; 

RL J. Neurosci. Res. 49:319-332(1997). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 
CC Phosphorylated MAP IB may play a role in the cytoskeletal changes 

CC that accompany neurite extension. Possibly MAP1B Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCl, LC2 and LC3, can associate 
CC with MAPI A and MAP1B proteins. 

CC -!- TISSUE SPECIFICITY: Nervous system (spinal cord, brain stem, 

CC cerebellum and cerebrum) . Not expressed in liver, spleen, kidney, 

CC heart or muscle. 

CC -!- DEVELOPMENTAL STAGE: In cerebral cortex, spinal cord and sciatic 
CC nerve levels are high early in development but decrease during 

CC postnatal development and are low in adults. In dorsal root 

CC ganglia levels remain high throughout development. 

CC -!- INDUCTION: By nerve growth factor. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 
CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP IB to microtubules. ■ 

CC -!- PTM: LCI is coexpressed with MAP1B. It is a polypeptide generated 
cc from MAP IB by proteolytic processing. It is free to associate with 

CC both MAPI A and MAP IB . It interacts with the amino-terminal region 

CC of MAP IB (By similarity) . 

CC PTM: Phosphorylated. 

CC -!- SIMILARITY: TO MAPlA. 

CC -!- CAUTION: A C-terminal fragment of this protein (residues 1597 to 
CC 2459) was originally described as neuraxin in Ref.3. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is. not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U52950; AAB17068.1; -. 

DR EMBL; X60370; CAC16162.1; -. 

DR EMBL; X16623; CAA34620.1; ALTJ3EQ. 

DR PIR; A56577; A56577. 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin; 10. 

DR PROSITE; PS00230; MAP1B_NEURAXIN; 8. 

KW Microtubule; Repeat; Phosphorylation. 

FT CHAIN ? 2459 MAPI LIGHT CHAIN LCI. 



FT 


REPEAT 


1869 


1885 


MAP IB 1. 


FT 


REPEAT 


1886 


1902 


MAP IB 2. 


FT 


REPEAT 


1903 


1919 


MAP1B 3. 


FT 


REPEAT 


1920 


1936 


MAP IB 4. 


FT 


REPEAT 


1937 


1953 


MAP1B 5. 


FT 


REPEAT 


1954 


1970 


MAP1B 6. 


FT 


REPEAT 


1988 


2004 


MAP IB 7. 


FT 


REPEAT 


2005 


2021 


MAP1B 8. 


FT 


REPEAT 


2022 


2038 


MAP1B 9. 


FT 


REPEAT 


2039 


2055 


MAP IB 10. 


FT 


DOMAIN 


559 


1035 


GLU-RICH. 


FT 


DOMAIN 


588 


786 


LYS-RICH (HIGHLY BASIC, CONTAINS MANY 


FT 








KKEE AND KKEI/V REPEATS) . 


FT 


DOMAIN 


2224 


c. O _L c~ 




FT 


CONFLICT 


127 


127 


M -> V (IN REF. 1) . 


FT 


CONFLICT 


140 


140 


T -> S (IN REF. 1) . 


FT 


CONFLICT 


2112 


2112 


R -> K (IN REF. 3) . 


FT 


CONFLICT 


2169 


2169 


L -> I (IN REF. 3) . 


SQ 


SEQUENCE 


2459 


AA; 269497 


MW; 2E3F6872DEDB8BA2 CRC64; 


Query Match 




5.0%; 


Score 291.5; DB 1; Length 2459; 



Best Local Similarity 20.0%; Pred. No. 0.0013; 
■ Matches 220; Conservative 167; Mismatches 456; Indels 255; Gaps 44; 

Qy 30 TEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPA 89 

: I I I I : I : : I I I : I : : : I I I I III:: I : 
Db 1008 SEEEGEEEEDKAEDAREEDHEPDKTE AEDYVMAWDKAAEAGVTEDQYDFL ■ 1058 

Qy 90 PRGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGAS 14 9 

1:1 I : I I II::: II I I I I 

Db 1059 GTPAKQ- PGVQSPSREPASSIHDETLPGGSESEAT AS 1094 

Qy 150 P LAE P AAP P S T P AAP K RRG S G S VD ET L FAL P AAS E P VI P S S AE K I MD LMEQ P GN TVS 206 

I I II I : : I I I I : I III 

Db 1095 DEENREDQPEEFTAT SGYTQST 1 EI S SEPTPMDEMSTPRDVMTDETNNEETES 1147 

Qy 207 SGQE DFPSVLLETAASLP SLSPLS TVS FKEHGYLGNLSAVS S S EGTI E 254 

II : I I II I : II I : I : I : I I 

Db 1148 PSQEFVNITKYESSLYSQEYSKPWAS FNGLSDGSKTDATDGRDYNASASTISPPSSMEE 1207 

Qy 255 ETLNEAS KELPERATNPFVNRDLAEF— SELEYSEMG 289 

: : : : : I : : : : I : I : I : I : : I 

Db 1208 DKFSKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERS 1267 

Qy 290 S S FKGS PKGESAI LVENTKEEVI VR SKDKEDL VCSAALHSP 330 

: I I : I I : : I : : I I : : : | I : I | : | 

Db 1268 WFSLTPNEIKASAEGEATAWSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTP 1327 

Qy 331 -QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFK PFEQAWEVKD 380 

:M :: : I I : : I I |: : I I :: II 

Db 1328 YYQSPTDEKSSHLPTEVT ENAQAVPVS FEFTEAKDENERS SIS PMDE — PVPD 1378 

Qy 381 TYEGSRDVLAARANVESKVDRKCLED — SLEQKSLGKDS EGRNEDASFPSTPEPVK 434 

• II.:: II I : | : | | : | I I : I I I I 

Db 1379 SESPIEKVLSPLRSPPLIGSE-SAYEDFLSADDKALGRRSESPFEGKNGKQGFSDKESPVS 1438 



Qy 



435 DSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 



DEKKI — 475 



Db 1439 D LTSDLYQDKQEEKRAGFIPIKEDFSPEKKASDAEIMSSQSALALDERKLGGDGS 1493 

Qy 476 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAV — 520 

I: I I I I:: | :|: I : :: |: |:| 
Db 1494 PTQVDVSQFGS FKEDTKMS I S EGTVS DKSAT PVDEGAEDT Y S HMEGVAS VS TAS VAT 1550 

Qy 521 SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES-LYPTAQLCP S 576 

1:11 Ml : : I : I : : I : I I I I I : : I : : I I I 

Db 1551 SSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMS 1609 

Qy 577 FEEAEATP SPVLPDIVMEAPLNSLL -PSAGASWQ 610 

: : I I : : I : I : I I I : I I :: 

Db 1610 ISPPDFSPKTAKSRT PVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHPTVGAGMLH 1669 

Qy 611 PSVSPLEAPPPVSYD S I KLEPENPPP YEEAMNVALKALGTKEGI KEPES FN 661 

: : III I I : I I I : : : : : : : I : : 

Db 1670 ITEN GPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLS-ELISVSQVEASPSTSS 1725 

Qy 662 AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 721 

I I I I : : I I I I I : : I : I : I II : I : 

Db 1726 AHTPS QIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKL SPKSDI 1775 

Qy 722 VDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL-ESFQP 780 

I : I I : : I I I I : I : : :| I I : : I 

Db 1776 SPLTPRESSPTYSPGFSDSTSG7VKES-TAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQH 1834 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

:| ::| :: : :|: :|| I : | ::: : | | |: 

Db 1835 HLALSRDLTTSSV— — EKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYE — SHEKTIQ 1888 

Qy 841 IIDEFPTFVSAKDDSPK LAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I : : : I I : I : I : : I I I ::: I 

Db 1889 AHDVGGYYYEKTERTIKSPCDSGYSYETIEKTTKTP EDGGYS CEITEKT 1937 

Qy 897 IYPKDEVHVSDEFSENRSSVSKAS ISPSNVSALEPQTEMGSIVKSKSL 944 

: I I I I I : : I : : : : | : | 

Db 1938 TRTPEEGGYSYEISEKTTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSY 1997 

Qy 945 TKEAEKKLPSDTEKEDRS 962 

: I : I : I II I 
Db 1998 SYETTEKITSFPESESYS 2015 



RESULT 14 
MAPB_HUMAN 

ID MAPB_HUMAN STANDARD; PRT; 2468 AA. 

AC P46821; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) [Contains: MAPI light chain 
DE LCI] . 
GN MAP1B. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



OX NCBI_TaxID=960 6; 

RN [1] 

RP SEQUENCE FROM N. A. 

RC TISSUE=Fetal brain; 

RX MEDLINE-95104835; PubMed=78 06212 ; 

RA Lien L.L., Feener C, Fischbach N., Kunkel L.M.; 

RT "Cloning of human microtubule-associated protein IB and the 

RT identification of a related gene on chromosome 15."; 

RL Genomics 22:273-280(1994). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 
CC Phosphorylated MAP IB may play a role in the cytoskeletal changes 

CC that accompany neurite extension. , Possibly MAP1B Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 
CC with MAPI A and MAP IB proteins. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 
CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP IB to microtubules. 

CC -!- PTM: LCI is coexpressed with MAPlB. It is a polypeptide generated 
CC from MAPlB by proteolytic processing. It is free to associate with 

CC both MAPI A and MAPlB. It interacts with the amino-terminal region 

CC of MAPlB (By similarity) . 

CC -!- SIMILARITY: TO MAP1A. 

CC • : 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L06237; AAA18904.1; 

DR Genew; HGNC:6836; MAPlB. 

DR MIM; 157129; -. 

DR GO; GO: 0005875; C : microtubule associated complex; TAS. 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin; 10. 

DR PROSITE; PS00230; MAP1B_NEURAXIN; 6. 

KW Microtubule; Repeat; Phosphorylation. 
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REPEAT 


1878 


1894 


MAPlB 
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FT 


REPEAT 


1895 


1911 


MAPlB 


2. 




FT 


REPEAT 


1912 


1928 


MAPlB 


3. 




FT 


REPEAT 


1929 


1945 


MAPlB 


4. 




FT 


REPEAT 


1946 


1962 


MAPlB 


5. 




FT 


REPEAT 


1963 


1979 


MAPlB 


6. 




FT 


REPEAT 


1997 


2013 


MAPlB 


7. 




FT 


REPEAT 


2014 


2030 


MAPlB 


8. 




FT 


REPEAT 


2031 


2047 


MAPlB 


9. 




FT 


REPEAT 


2048 


2064 


MAPlB 


10. 




FT 


DOMAIN 


589 


790 


LYS-RICH 


(HIGHLY BASIC, CONTAINS 


FT 








KKEE AND 


KKEI/V REPEATS) . 


SQ 


SEQUENCE 


2468 


AA; 270618 


MW; 540839CBDF09D461 CRC64; 



Query Match 4.9%; Score 288; DB 1; Length 2468; 

Best Local Similarity 20.2%; Pred. No. 0.0017; 

Matches 251; Conse rvative 166; Mismatches 441; Inolels 386; Gaps 56; 

Qy 13 STDSPPR — PPPAFKYQFVTEPEDEEDEEE EED EEEDDED 50 

: I I I : I : : I I I : : : I : I : I I II I 

Db 625 ATDVKPKAAKEKTVKKETKVKPEDKKEEKEKPKKEVAKKEDKTPIKKEEKPKKEEVKKEV 684 

Qy 51 LEELEVLERKPAAGLSAAAVP PAAAAPLLDFS S DS VP PAPRGPLPAAP PAAP ERQP SWER 110 

: I :: | : | | : :: | : : ::: / | : 
Db 685 KKEIKKEEKK EPKKEVKKETPPKEVKKEVKKEEKKEVKKE EK 72 6 

Qy. Ill SPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRG-- 168 

I III : I I : I I I I I : I I I : I 

Db 727 EPKKEIKKLPKDAKKSSTPLSEAKKPi^KPKVPKKEESVKKDSVA AGKPKEKGKI 7 82 

Qy 169 SGSVDETLFAL PA AS E P VI P S S AE K I MDLME 199 

I I : I II | :: | : | | - 

Db 783 KVI KKEGKAAEAVAAAVGTGATTAAVMAAAGIAAI GPAKELEAERS LMS S PEDLTKDFEE 842 

Qy 200 QPGNTVS SGQEDFPSV- LLETAASLPSLS PLSTVS FKEHGYLGNLSAVS S SEGTI EETLN 258 

I : : | : | : | I | : : : : I I I I I I 

Db - 843 LKAEEVDVTKDIKPQLELIEDEEKLKETEPVEAYVIQKEREVTKGPAESPDEG-TTTTEG 901 

Qy 259 EAS KELPERATN P FVNRDLAEFS ELEYS EMGS S FKGS PKGESAI LVENT K- EEVI VRS KD 317 

I I I : : : : I : | : I : I | : I : I I : I 

Db ' 902 EGECEQTPEELEPVEKQGVDDIEKFE — DEGAGFEES — SETGDYEEKAETEEAEEPEED 957 

Qy 318 KEDLVC- SAALHS P QESPVGKEDRWSPEKTMDIFNEMQMSV 358 

I : I I I I : I I I : I I : I I I : I I I 
Db 958 GEEHVCVSASKHSPTEDEESAKAEADAYIREKRESVASGDDRA EEDMD EAIEKG 1011 

Qy 359 VAPVREEY7U)FKPFEQAWEVK-DTYEGSRDVLAARANVESKVDRKCLEDSLEQ 410 

I | | | | : : : I : : : I I : : I I : I I : I : 

Db 1012 EAEQSEEEADEE — DKAEDAREEEYEPEK— MEAEDYVMAWDKAAEAGGAEEQYGFLTT 1067 

Qy 411 — KSLGKDSEGR NEDASFPSTPEPVKDSSRAYITCASFTSATESTT 454 

I I I I I I : | : | | : | : : : | : | : 

Db 1068 PTKQLGAQSPGREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEIS 1127 

Qy 4 55 ANTFPL LEDHTSENKTDEK KIEERKAQIITEKTS-PKTSNPFLVAV 4 99 

: I : : | | : : | : | ::::::: | | | 

Db 1128 SEPTPMDEMSTPRDVMSDETNNEETESPSQEFWITKYESSLYSQEYSKPADVTPLNGFS 1187 

Qy 500 QDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDL 556 

: I : I II | : : | : | : I I :| ::: I : I : 
Db 1188 EGSKTD — ATDGKDYNASASTISPPSSMEED KFSRSALRDAYCSEVKASTTLDIKDS 1242 

Qy 557 — VQT S EAI QES L YPTAQLC P SFE EAEATPSPVLPDIV 592 

: I I : I I : I : I I I I : I I I : : 

Db 1243 ISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFSLTPNEIKVSAEAEVAPVSPEVT 1302 

Qy 593 ME APLNSLLPSAG ASWQ 610 

I : | | : | M I : : 

Db 1303 QEWEEHCAS PEDKTLEWS PSQSVTGSAGHTPYYQS PTDEKS SHLPTEVI EKPPAVPVS 1362 

Qy 611 PSVSPLEAPPPVSYDSIK — LEPENPPP YEEAMNVALKALG 64 9 



II I I : : I I I I : II II II : : III 

Db 1363 FEFSDAKDENERASVS PMDEPVPDSESPIEKVLSPLRSPPLIGSESAYESFLSADDKASG 1422 

Qy 650 TKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPD-FSNYSEIA KF 701 

: I : I : II : I I I I I I : I 

Db 1423 RGAESPF EEKSGKQGSPDQVSPVSEMTSTSLYQDKQ 1458 

Qy 702 EKSVPEHAELVEDSSPE SEPVDLFSDDSIPEVPQTQ EEAV 741 

I : | : | | | | : | : : : | | | I : 

Db 1459 EGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPTQIDVSQFGSFKEDTK 1518 

Qy 742 MLMKE SLTEVSETVAQ HKEERLSASPQELG- KPYLESFQPNLHSTK 786 

I : I I I I I I I : II II : : I : I : II 
Db 1519 MS I S EGTVS DKSAT PVDEGVAEDT YS HMEGVAS VSTAS VAT S S FPEPTTDDVS P S LH 1575 

Qy 787 DAASNDIPTLTK-KEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEF 845 

I | | : : : | : : : | : : : I I I : : 
Db 1576 — AEVGSPHSTEVDDSLSVSWQTPTT-FQETEMSPSKEECPR 1615 

Qy 846 PTFVSAKDDSPKLAKEYTDLE — VSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I :l I III II I :: I:: I : :|: I : J I I I I :: 
Db 1616 PMSISPPDFSPKTAKSRTPVQDHRSEQSSM-SIEFGQES-PEQSLAMDFSRQSPDHPTVG 1673 

Qy 897 IYPKDE-VHVSDEFSENRSSVSKASISPSNV 926 

I I : I : I III: III 

Db 1674 AGVLHITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTS 1733 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE 970 

I I ■ I : : : I : : : I : I : I I I I I : : I 

Db 1734 SAHTP-SQIASPLQEDTLSDVAPPR DMSLYASLTSE 1768 



RESULT 15 - . 

MAPB_MOUSE 

ID MAPBJVKXJSE STANDARD; PRT; 2464 AA. 

AC P14873; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) (MAPI. 2) (MAPI (X) ) 

DE [Contains: MAPI light chain LCI]. 

GN MAP IB OR MTAP1B OR MTAP5 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. , AND DOMAIN. 

RC STRAIN=Swiss Webster; TISSUE=Brain; 

RX MEDLINE=90094539; PubMed=2480963; 

RA Noble M. , Lewis S.A., Cowan N.J.;. 

RT "The microtubule binding domain of microtubule-associated protein 

RT MAP1B contains a repeated sequence motif unrelated to that of MAP2 . 

RT and tau." ; 

RL J. Cell Biol. 109:3367-3376(1989). 

CC FUNCTION: The function of brain MAPS is essentially unknown. 

CC Phosphorylated MAP1B may play a role in the cytoskeletal changes 



CC that accompany neurite extension. Possibly MAP IB Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 
CC with MAPI A and MAP IB proteins. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 

CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP1B to microtubules. 

CC -!- PTM: LCI is coexpressed with MAPlB. It is a polypeptide generated 

CC from MAP1B by proteolytic processing. It is free to associate with 

CC both MAPI A and MAP1B. It interacts with the amino- terminal region 

CC of MAP1B. 

CC -!- SIMILARITY: TO MAP1A. 

CC : 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage, by and for commercial 

CC entities requires a license agreement (See http://www.isb-siJD.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X51396; CAA35761.1; -. 

DR PIR; S07549; QRMSPl. 

DR MGD; MGI: 1306778; Mtaplb. 

DR GO; GO: 0016358; P: dendrite morphogenesis; IMP. 

DR GO; GO:0001578; P microtubule bundling; IMP. 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin; 10. 

DR PROSITE; PS00230; MAP 1 B_NEURAX I N ; 7. 

KW Microtubule; Repeat; Phosphorylation. 
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MAPI LIGHT CHAIN LCI. 


FT 


REPEAT 


1874 


1890 




MAP1B 


1. . . 


FT 


REPEAT 


1891 


1907 




MAP1B 


2. 


FT 


REPEAT 


1908 


1924 




MAP1B 


3. 


FT 


REPEAT 


1925 


1941 




MAP IB 


4. 


FT 


REPEAT 


1942 


1958 




MAP IB 


5. 


FT 


REPEAT 


1959 


•1975 




MAP1B 


6. 


FT 


REPEAT 


1993 


2009 




MAP1B 


7. 


FT 


REPEAT 


2010 


2026 




MAP IB 


8. 


FT 


REPEAT 


2027 


2043 




MAP IB 


9. 


FT 


REPEAT 


2044 


2060 




MAP IB 


10. 


FT 


DOMAIN 


589 


787 




LYS-RICH (HIGHLY BASIC, CONTAINS MANY 


FT 










KKEE AND KKEl/V REPEATS) . 


SQ 


SEQUENCE 


2464 


AA; 270408 


MW; FBD3DD99CFDBDA87 CRC64; 


Query Match 




4. 


.9%; 


Score 


284; DB 1; Length 2464; 


Best Local 


Similarity 20. 


.7%; 


Pred. 


No. 0.0025; 



Matches 233; Conservative 159; Mismatches 424; Indels 312; Gaps 51; 

Qy 31 EPEDEEDEEEEEDEEED DEDLEELE VLERKPAAG LSAAAVP 71 

I I I : I I I I I : I I : I I I I : • II I : I 

Db 1009 EAEQSEEEGEEEDKAEDAREEGYEPDKTEAEDYVMAVADKAAEAGVTEEQYGYLGTSAKQ 1068 

Qy 72 PAAAAPLLDFSS D S VP PAP RG P L P AAP P AAP E RQ P SWERSPAAPA 116 

I : I : : I : : : I I : III I : I 



Db 1069 PGIQSPSREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEISSEPT 1128 

Qy 117 P SLPPAAAVLPSKLPEDDEPPARP PPPPPAGASPLAEPA-- 155 

I I : I : I I : : I I : I : I : 

Db 112 9 PMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKPAVASFNGLSEGSKT 1188 

Qy 156 AP P S T PAAP KRRGS GS VD ET L FAL PA ASEPVIPSSAEKIMDLMEQ 200 

I I I : I I : : I I : I : I : : I II : I : : 

Db 1189 DAT DGKD YNAS AST I S P P SSMEEDKFSKSALRDAYCSEEKELKASAE — LDIKDV 1241 

Qy 201 PGNTVSSGQEDFPSVLLET7\ASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEA 260 

: I I I I I I I I I I : I 
Db 1242 SDERLS PAKSPSLSP SPPSPIEKT 1265 

Qy 261 SKELPERATNPFVNRDLAEFS ELEYSEMGSSFKGSPKGESAI LVEN — TKEEVIVR 314 

I I I : I ,11 | : : | | : I | | : : | : : | | : 

Db 1266 — PLGERSVN FSLTPNEIKVSAEGEARSVSPGVTQAWEEHCASPEEKTLE 1314 

Qy 315 SKDKEDLVCSAALHSP-QES PVGKEDRWS PEKTMDI FNEMQMS WA- PVREEYADFK — 370 

I : I I : I : I I : I I : : I": : III I : : : I 

Db 1315 WSPSQSVTGSAGHTPYYQSP TDEKSSHLPTEVSENAQAVPVSFEFSEAKDE 1366 

Qy 371 P FEQAWEVKDT YEGS RDVLAARANVES KVDRKCLED — S LEQKS LGKDS 417 

I : : II: I I : : I I ■ I : I I I : I 

Db 1367 NERASLSPMDE— PVPDSESPVEKVLSPLRSPPLLGSESPYEDFLSADSKVLGRRSESPF 1424 

Qy 418 EGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 470 

I I : I II Ml: : : I : I : I I III 

Db 1425 EGKNGKQGFPDRESPVSDLT STGLYQDKQEEKSTGFIPIKEDFGPEKKTSDVETMS 1480 

Qy 471 DEKKI EERKAQIITEKTSPKTSNPFLVAVQDSEAD 505 

I I : I : I : I I I I : : I I : I 

Db 14 81 SQSALALDERKLGGDVS PTQIDVSQFGS FKEDTKMS I SEGTVSDKSATPVDEGV — AEDT 1538 

Qy 506 YVTTDTLSKVTEAAV — SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAI 563 

I : : : I : I : I I : I I III : : I : I : : I : I I I 

Db 1539 YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 1597 

Qy 564 QES-LYPTAQLCP SFEEAEATP SPVLPDIVMEAPLNSLL — 601 

I I : : I : : I I I : : I I : : I : I : I 

Db 1598 QETEMSPSKEECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSFAMD 1657 

Qy 602 PSAGASWQPSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVA 644 

I : II I I : : : III I I : I I I : : : : 

Db 1658 FSRQSPDHPTLGASVLHITEN GPTEVDYSPCDIQDSSLSHKIPPTEEPSYTQDNDLS 1714 

Qy 645 LKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKS 704 

: : : I : : I I I I : : I I : I I : : I 
Db 1715 -ELISVSQVEASPSTSSAHTPS QIASPLQEDTLSDWPPREMSLYASLA 1762 

Qy 705 VPEHAELVEDS S PESEPVDLFSDDS I PEVPQTQEEAVML MKESLTEVSETVAQHKE 760 

: I I : III: III II I : I : I : I I I I 

Db 1763 SEKVQ — SLEGEKLSPKSDIS PLTPRESSPLYSPGFSDSTSAAKETAAAH-- 1810 

Qy 761 ERLSASP QELGKPY LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFN 810 

: I : I I : I I : : I : I : I : .: : : I : : | | 

Db 1811 -QAS S S PPIDAATAEPYGFRS SMLFDTMQHHLALNRDLTT S SV EKDSGGKTPGDFN 1865 



Qy 811 TAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSPK LAKEYTDLE 866 

I : : | : : : : I |. I I : : : I I : I 

Db 1866 YAYQKPENAAGSPDEEDYDYE — SQEKTIRTHDWRYYYEKTERTIKSPCDSGYSYETIE 1923 

Qy 867 VSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKAS 920 

: I : : I I : : : I : I MM: M 

Db 1924 KTTKTP -EDGG . — YTCEITEKTTRTPEEGGYSYEISEKTTRTPEVSGYTYEK 1972 

Qy 921 ISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRS 962 

: : : M : I M : I M II I 

Db 1973 TERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESESYS 2020 



Search completed: September 3, 2004, 16:06:02 
Job time : 25.4629 sees 



